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Abstract 

The most promising approaches for efficient detection in multiple-input multiple-output (MIMO) 
wireless systems are based on sphere-decoding (SD). The conventional (and optimum) norm that is used 
to conduct the tree traversal step in SD is the /^-norm. It was, however, recently observed that using the 
Z°°-norm instead reduces the hardware complexity of SD considerably at only a marginal performance 
loss. These savings result from a reduction in the length of the critical path in the circuit and the 
silicon area required for metric computation, but are also, as observed previously through simulation 
results, a consequence of a reduction in the computational (i.e., algorithmic) complexity. The aim of 
this paper is an analytical performance and computational complexity analysis of /°°-norm SD. For 
i.i.d. Rayleigh fading MLMO channels, we show that /°°-norm SD achieves full diversity order with an 
asymptotic SNR gap, compared to /^-norm SD, that increases at most linearly in the number of receive 
antennas. Moreover, we provide a closed-form expression for the computational complexity of ^°°-norm 
SD based on which we establish that its complexity scales exponentially in the system size. Finally, we 
characterize the tree pruning behavior of Z°°-norm SD and show that it behaves fundamentally different 
from that of /^-norm SD. 
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I. Introduction 

Multiple-input multiple-output (MIMO) wireless systems offer considerable gains over single- 
antenna systems, in terms of throughput and link reliability, see, e.g., [1]. These gains come, 
however, at a significant increase in receiver complexity. In particular, one of the most challenging 
problems in MIMO receiver design is the development of hardware-efficient data detection 
algorithms achieving (close-to) optimum performance [2]. Among the most promising approaches 
to the solution of this problem is the so-called sphere-decoding (SD) algorithm [3]-[8], which 
performs optimum, i.e., maximum-likelihood (ML), detection through a weighted tree search. 
SD exhibits (often significantly) smaller computational complexity than exhaustive search ML 
detection [2], [9]. 

A. Hardware Implementation Aspects of SD 

Hardware implementations of several variants of the SD algorithm are described in [2], [7]. It 
is argued in [7] that the overall hardware complexity of a SD is essentially determined by (i) the 
computational (i.e., algorithmic) complexity in terms of the number of nodes visited in the tree 
search and (ii) the circuit complexity in terms of the length of the critical path in the circuit and 
the required silicon area for metric computation. The length of the critical path limits the clock 
frequency of the circuit [10]. One of the main findings of [7] is that replacing the /^-norm in the 
ML detector by the /°°-norm and hence traversing the search tree based on the /°°-metric incurs 
only a small performance loss while significantly reducing the overall hardware complexity of 
SD by virtue of a reduction of both the computational and the circuit complexity. 

To understand where the reduction in circuit complexity comes from, we refer to Fig. [T] (cf., 
[7, Fig. 2]) showing tradeoff curves between circuit area and the length of the critical path 
corresponding to the computation of the metrics x\ + (squared /^-norm) and max{|xi|, |x2|} 
(/°°-norm) for Xi,X2 G M. These tradeoffs can be achieved by choosing different hardware 
implementations of the corresponding metric computation circuit. From Fig. [T] it can be seen 
that the computation of max{|xi|, |x2|} can be implemented much more efficiently in hardware 
than the computation of + x^. The main reason for this is that evaluating max{|xi|, \x2\], 
in contrast to x\ + x^, does not require squaring operations. Replacing the P- by the Z°°-norm 
also has an impact on the computational complexity of SD. In particular, it was observed in [7], 
through simulation results, that SD based on the Z°°-norm (referred to as SD-/°°) exhibits lower 
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Fig. 1. Circuit area and critical pathi length tradeoff curves corresponding to the computation of xl + X2 (squared /^-norm) 
and max{|a;i|, \x2\} (/°°-norm) for x-i_,X2 G M. The area is given in gate-equivalents (GE) and the length of the critical path is 
given in nano seconds (ns). W denotes the word length. 



computational complexity than SD based on the /^-norm (referred to as SD-/^). Furthermore, 
the results in [7] indicate that the overall complexity (determined by both the circuit and the 
computational complexity) of SD-/°° is up to a factor of 5 lower than the overall complexity of 
SD-P. SD-/°° therefore appears to be a promising approach to near-optimum MIMO detection 
at low hardware complexity. 

B. Contributions 

The aim of this paper is to deepen the understanding of SD-/°° through an analytical perfor- 
mance and computational complexity analysis for i.i.d. Rayleigh fading MIMO channels. Our 
main contributions can be summarized as follows: 

• We show that SD-/°° achieves the same (i.e., full) diversity order as SD-/^. 

• We show that the gap in signal-to-noise ratio (SNR) incurred by SD-/°°, compared to SD-/^, 
increases at most linearly in the number of receive antennas. 

• We derive a closed-form expression for the complexity of SD-/°^. Here and in the remainder 
of the paper, complexity is defined as the average number of nodes visited in the tree search, 
where averaging is performed with respect to the (random) channel, noise, and transmit 
signal. Corresponding results for SD-/^ can be found in [9], [11]-[13]. 

• We prove that the complexity of SD-/°° scales exponentially in the number of transmit 
antennas. Our proof technique directly extends to SD-/^ and thus yields an alternative (vis- 
a-vis [14]) proof of the exponential complexity scaling behavior of SD-Z^. 
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• Finally, we provide insights into the tree pruning behavior of SD-/°° relative to that of SD- 
l'^. In particular, based on an asymptotic (in SNR) analysis of our closed-form complexity 
expressions, we show that SD-/°° tends to prune more aggressively than SD-/^ at tree levels 
closer to the root of the search tree, whereas this behavior is reversed at tree levels closer 
to the leaves. 

C. Outline 

The paper is organized as follows. After introducing the system model and briefly reviewing 
relevant aspects of SD-/^ and SD-/°° in the remainder of Section |l| we analyze the error 
probability behavior of SD-Z°° in terms of diversity order and SNR gap in Section |llj In Section 
[Till we derive a closed-form expression for the complexity of SD-/°°. This result is then used to 
establish the exponential complexity scaling behavior (in the number of transmit antennas) of SD- 
and to analyze, in Section |W} the tree pruning behavior of SD-/°° by means of an asymptotic 
(in SNR) analysis. In Section |V} we report modifications of the results presented in Section [ll] 
and Section [In] to account for the slightly modified metric used in the hardware implementations 



reported in [7]. Numerical results are provided in Section VI Section Vn concludes the paper. 



D. Notation 

We write j for the entry in the ith row and jth column of the matrix A and Xj for the iih. 
entry of the vector x. For unitary A, we have A^A = AA^ = I, where ^ denotes conjugate 
transposition, i.e., transposition ^ followed by element-wise complex conjugation *, and I is 
the identity matrix. The and the /°°-norm of a vector x = (xi ■ ■ ■xm)'^ € C*^ are defined 
as ||x||2 = a/|xiP + ■ ■ ■ + |xmP and ||x||oo = max{|a;i|, . . . , |xm|}, respectively. We will also 
need the /°°-norm ||x||55 = max{|a;R,i|, |xi,i|, . . . , |xi,a/|}, where xr and xi denote the real and 
imaginary parts, respectively, of x G C. We note that the /^-norm is invariant with respect 
to (w.r.t.) unitary transformations, i.e., ||x||2 = ||Ax||2 if A is unitary. ]E{-} stands for the 
expectation operator and = Eje"*^} refers to the moment generating function (MGF) of 

the random variable (RV) x. We write x ~ if the RV x is x-distributed with a > degrees 
of freedom and normalized such that E{x^} = a. The probability density function (pdf) of the 
RV X ~ Xa is then given by [15] 

2l-a/2 2 

^ ' r(a/2) ' - ^ ' 
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and f.j.(t) = 0, t < 0, where r(a) = y"-~'^e~'^dy refers to the Gamma function. For the 
corresponding cumulative distribution function (cdf) we have P\x < t\ = 70/2(^^/2). Here, 
7a (t) denotes the (regularized) lower incomplete Gamma function; some important properties 
of 7a (i) are stated in Appendix |C} We denote a circularly symmetric complex Gaussian RV 
with variance cr^ as a; ~ CJ\f{0, cr^); x ~ M{fJ.x, (^1) refers to a real- valued Gaussian distributed 
RV X with mean Hx and variance a^. For independently and identically distributed (i.i.d.) RVs 

Xi ~ A/'(0, 1), i = 1, . . . , a, we have z = \/x\^ V x\ ~ Xa- Furthermore, if the RV x is Xa- 

distributed, x^ is Xa -distributed. We write y ~ Xa if ^ V is Xa'distributed with E{?/} = a. 
In particular, the MGF of the RV ?/ ~ Xa is given by 

^y{s) = (l-2s)-"/2 (2) 

for any s < 1/2. The Q-function is defined as Q{x) = (l/v^27r) J^°° e'^^/^dy, for a; > 0. 
For equality in distribution we write =. Furthermore, the "Big O" notation g{x) = 0{f{x)), 
denotes that \g{x)/ f{x)\ remains bounded as x Xq [16]. The "little o" notation 
g{x) = o{f{x)), X xo, stands for lim^^^Q = 0, and g{x) ~ f{x), x xq, 

means that linix^xo 9i^)/ fi^) = 1- By g{x) ^ fix), x — > Xq, and g{x) y f{x), x —> xq, for 
positive functions g{x) and f{x), we denote limj-^a-g g{x)/ f{x) < 1 and Xmix^xo g{x)/ f{x) > 1, 
respectively. The Dirac delta function is referred to as 5{x), convolution is denoted as *, and 
the natural logarithm to the base e is referred to as log(-). The summations in and Xlx^x' 
are over all possible values of x and over all possible values of x except for x', respectively. 
Finally, (x) refers to the nth derivative of the function f(x) and f'{x) = f^^\x). 

E. System Model 

We consider an A^xM MIMO system with M transmit antennas and N > M receive antennas. 
The corresponding complex-baseband input-output relation is given by 

r = Hd' + w 

where d' = {d[ ■ ■ ■ d'jy.j)'^ denotes the transmitted data vector, H is the N x M channel matrix, 
r = (ri ■ ■ ■ tat)^ is the received vector, and w = {wi ■ ■ ■ wtv) '^denotes the additive noise vector. 
The symbols d'^, drawn from a finite alphabet A, have zero-mean and unit variance. Furthermore, 
we assume that the Hn,rn are i.i.d. CAf{0, 1/M) and the Wn are i.i.d. CAf{0, a^). The SNR (per 
receive antenna) is therefore given by p = l/a^. 
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F. Sphere-Decoding 

We now briefly review SD based on the /^-norm [3]-[6], [8] and (suboptimum) SD based on 
the /°°-norm [7]. 

1) SD based on the l^-norm: SD-P performs ML detection by finding 



arg mm ||r 



mil 



(3) 



through a tree search subject to a sphere constraint (SC), which amounts to considering only 
those data vectors d that satisfy ||r — Hd||2 < C| (known as the Fincke-Pohst [3] strategy). 
Here, the radius C2 has to be chosen sufficiently large for the corresponding search sphere to 
contain at least one data vector. Note, however, that if C2 is chosen too large, too many points 
will satisfy the SC and the complexity of SD-/^ will be high (for guidelines on how to choose 
C2 see [9], [17] and Section III-E[ ). The SC is then cast into a weighted tree search problem by 
first performing a QR-decomposition of H resulting in 



H = Q 



R 




where Q is an x unitary matrix, R is an M x M upper triangular matrix, and denotes an 
all-zeros matrix of size (A^ — M)xM. Then, the SC can equivalently be written as 



where 



z(d)=y 



R 

L J 



|z(d)||^ < Cl 



d with y = Q^r 



R 

J 



d' + n. 



(4) 



(5) 



Here, the unitarity of Q implies that n = Q^w is again i.i.d. CA/'(0,(t^). The data subvectors 
dfc G of length k 

dfc = {dM-k+i ■ ■ ■ dM)^, k = l,...,M, 

can be arranged in a tree with root above level k = 1 and corresponding leaves at level k = M; 
a specific d^ is associated with a node in this tree at level k. Let us define 



Zfe(dfc) 



Rfc 




dfc 



as the vector containing the bottom k + L with L = N — M elements of z(d) in ([5]). 
Here, R^ denotes the k x k upper triangular submatrix of R associated with d^ and 
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Yfc = ivM-k+i ■■■ Vm Vm+i ■ ■ ■ VnV- The metric ||z(d)||2 = ||zM(dM)||i can then be computed 
recursively according to 



|zfc(d 



k)\\2 



|zfc-i(dfc_i)||2 + |[z(d)]M-fe+i| , k = l,...,M, 



where 



(d)] 



A/-fc+l 



M 



UM-k+l — RM~k+l,i di 

i=M-k+l 



(6) 



(7) 



Thus, with ([6]), a necessary condition for d to satisfy the SC (|4]) is that any associated d^ satisfies 
the partial SC (PSC) 

(8) 



|Zfc(dfc)ll2 



Consequently, we can find all data vectors inside the search sphere, i.e., all data vectors satisfying 
the SC (|4]), through a weighted tree search. The tree is traversed starting at level A; = 1. If the 
PSC is violated by a given d^, the node associated with that d^. along with all its children is 
pruned from the tree. The ML solution (|3]) is found by choosing, among all surviving leaf nodes 
d = dpi, the one with minimum ||z(d)||2. 

2) SD based on the l°°-norm: We define SD-/°° as the algorithm obtained by replacing the 
SC Q by the box constraint (BC) ||z(d)||oo < Coo- The metric ||z(d)||oo can be computed 
recursively according to ||zfe(dfc)||oo = max{ ||zfc_i(dfe-i)||oo, | [z(d)]M-fc+i| }• Consequently, the 
PSC is replaced by the partial box constraint (PBC) 



|zfc(d, 



fcJ oo 



(9) 



If the PBC is violated by a given d^, the node associated with that d^ along with all its children 
is pruned from the tree. The Z°°-optimal solution is obtained by choosing, among all surviving 
leaf nodes d = d^/, the one with minimum ||z(d)||oo, i.e.. 



arg min ||z(d)||c 



(10) 



Slightly abusing terminology, we call the side length Coo of the search box the "radius" associated 
with SD-Z°°. Like in the SD-/^ case with C2, here the radius Coo has to be chosen large enough 
to ensure that at least one data vector is found by the algorithm. Again, however, choosing Coo 
too large will in general result in a high complexity of SD-/°° (for guidelines on how to choose 



Coo we refer to Section III-E) 
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We emphasize the following aspects of SD-/°°: 

• The SD-/°° hardware implementation reported in [7] is actually based on the 
norm ||x||55 = max{|xR,i|, |a;i,i|, . . . , |a;i,A./|} rather than the /°°-norm ||x||oo = 
max||xi|, . . . , x G C^^. Here, the essential aspect is that the computation of the 

/°°-norm, as opposed to the and the /^-norm, does not require squaring operations, 



which, as already noted in Section I-A results in significantly smaller circuit complexity. 
Nevertheless, in the following, for the sake of simplicity of exposition, we first analyze SD- 
l°°, i.e., SD based on the conventional /°°-norm, thereby revealing the fundamental aspects 
(w.r.t. performance and complexity) of SD using the /°°-norm (referred to as SD-/°°). The 
modifications of the results on SD-l°° needed to account for the use of the /°°-norm are 
described in Section |Vl 

• The tree search strategy underlying SD-/°° is identical to that of Kannan's strategy (see, e.g., 
[5], [18]), which also finds all data vectors inside a hypercube. The difference between SD- 

and Kannan's algorithm lies in calculating the final detection result. SD-/°° implements 
( fTO] ) while Kannan's approach is optimum as it implements ([3]). Optimality of Kannan's 
algorithm is achieved through (i) guaranteeing that the solution of (|3]) is contained inside 
the search hypercube (which, in general, necessitates choosing the search radius to be larger 
than the corresponding radius for SD-/°° and hence incurs a higher complexity) and (ii) in 
the last step comparing all found data vectors with respect to their /^-distance ||r — Hd||2 
(which, in contrast to SD-/°°, necessitates squaring operations). 

• Finally, we emphasize that SD-/°° as defined above does not correspond to /°°-norm 
decoding on the "full" channel matrix H according to 

doo,fuii = arg min ||r - Hd||oo (11) 

since ||r — Hd||oo 7^ ||z(d)||oo, in general. This statement also holds true for SD based on 
the Z°°-norm. In the Z^-norm case detection on the full channel matrix H is equivalent to 
detection on the upper triangular matrix R. 

II. Error Probability of SD-Z°° 

In this section, we show that SD-/°° achieves the same diversity order as ML (i.e., SD-/^) 
detection and we quantify the SNR loss incurred by SD-/°°. 
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A. Distance Properties 

We start by investigating distance properties of the SD-/°° solution doo- Using the bounds 

^||x||2<||x||^<||x||^, (12) 

valid for any vector x G C^, we obtain 



r-Hd. 



R 

L 



R 

L 



< N 



R 

J 



< N 



R 

L J 



iV r-Hd 



ML 



(13) 



We are therefore guaranteed that ||r — Hdoo||2 lies within a factor of vA^ of the minimum 
distance ||r — HdMLjlg realized by the ML detector Q. Trivially, SD-/°° is optimum for = 1 
(simply because the Z°°-norm equals the Z^-norm in this case). For increasing A^, ( [T3] ) suggests 
an increasing performance loss incurred by SD-/°° when compared to the ML detector (i.e., 
SD-P). In the next section, we quantify this performance loss in terms of diversity order and 
SNR gap. We note that for Babai's nearest plane algorithm [19] (which can be interpreted as a 
decision-feedback detector in combination with LLL lattice reduction, see, e.g., [5]), we get a 
result that is structurally similar to ( [T3| ) when doo is replaced by Babai's detection result and the 
factor A^ is replaced by 2*^^^^^ Consequently, the performance loss incurred by Babai's nearest 
plane algorithm can be expected to be significantly larger than that incurred by SD-/°°. 



B. Diversity Order and SNR Gap 

We denote the error probability as a function of SNR p as P(p). In the following, we will only 
encounter error probabilities of the form P(p) = {Kp)^^ + o{p^^), p ^ oo, with some constant 
K > not depending on p. We can define the corresponding SNR exponent 5 as [20], [21] 

S = -lim (14) 

oo log p 

Furthermore, if Pi(p) and P2(p) have the same SNR exponent, we can define an asymptotic 
SNR gap a via Pi(p) ~ P2(a;p), p ^ oo. For example, if Pi(p) = {Ki p)^^ + o{p^^) and 
P2(p) = (-^2 p) '^ + o(p^'^), we have a = K1/K2. Our analysis corresponds to multiplexing gain 
r = in the framework of [21]. The corresponding results bear practical significance as it can be 
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shown, for example, that even for r = conventional suboptimum detection schemes like linear 
equalization-based or V-BLAST detectors are unable to achieve the full diversity order of 
[1], [21], [22]. In the following, we first focus on the behavior of the pairwise error probability 
(PEP) and then analyze the total error probability. 

1 ) Pairwise Error Probability: Assume that the data vector d' was transmitted. The probability 
of erroneously deciding in favor of another data vector d 7^ d' is denoted as Pd'^d,ML(/o) in the 
SD-/^ case and Pd'^d,cxD(p) in the SD-/°° case. To derive (an upper bound on) Pd'^d,oo(p), we 
first present a somewhat unconventional approach for upper-bounding Pd'^d,ML(p), which lends 
itself nicely to an extension to the /°°-case. We start from 



Pd'^d,ML(p) = P 
= P 



|r-Hd||2 < ||r-Hd'||2 
iHb + wlU < llwlU 



with the error (difference) vector b = d' — d. Applying the inverse triangle inequality according 
to ||Hb + w||2> I ||Hb||2 — ||w||2|, we further obtain 

1, 



Pd'^d,ML(p) < P 



|w||2 > -||Hb||2 



(15) 



noting that \x\ > x, for all x G M. With ^||w||2 ~ X2N, conditioning on H, and applying the 
Chemoff upper bound yields 

1 



|w||2 > -||Hb||2 



H 



||Hb||^ 



for s e [0, 1/2). Here, ^^iJ-'^^ denotes the MGF of a X2Af-distributed RV (see ([2])). Averaging 
over H then results in 



Pd'^d,ML(p) <$;,!>) Eh^-^" 



sp 



2M 



(16) 



because 2M||Hb| 



Ibll ~ 



for ^ given b. For high SNR, the right hand side (RHS) of 



16]) is minimized for s = 1/4, which gives 



Pd'^d,ML(p) < 2^ 1 + p 



2\-N 



8M 



(17) 



Since is the maximum diversity order that can be achieved over an A^ x M MIMO channel 
with the transmission setup considered in this paper (i.e., spatial multiplexing) [21], we can 
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immediately conclude that the SNR exponent of Pd'^d,ML(p) equals N for any non-zero b (see 



also the lower bound ([22]) on Pd'^d,ML(/o) having an SNR exponent of as well). 



For SD-/°° we can follow a similar approach. Starting with ( [T0| ), we get 



Pd'^d,oo(p) <P 

= p 



Izfd) 



R 




b + n 



< llnl 



(18) 



Note that for SD-l°°, unlike for SD-Z^, the event ||z(d)||oo = ||z(d')||oo can, in general, occur 
with non-zero probability. Declaring an error in this case certainly yields an upper bound on 
Pd'^d,oo(p)- Next, we apply the upper and lower bounds in ([12]) and exploit the invariance of 
the /^-norm to unitary transformations to get 

1 



Pd'^d,oo(/0) <P 
= P 



R 




b + n 



< n 



N 



iHb + wlU < llwlU 



(19) 



Finally, applying the inverse triangle inequality according to ||Hb + w||2 > |||Hb||2 — ||w||2|, 
we have 



Pd'^d,oo(p) <P 



Wo > 



1 



iHbl 



(20) 



Note the structural similarity of ( [20] ) and ( [T5| ). Employing the same arguments as in the SD-/^ 
case, we get 

^ |Hb||2 



W 9 > 



N+1 



H 



< $^2 (s) e '^{Twp 



for s E [0, 1/2). Averaging over H then results in 

Pd'^d,oo(p) <2^fl+p 



|b||i 



-TV 



UBoo(p) 



(21) 



2{VN+1) Mj 

where we used the fact that s = 1/4 minimizes the upper bound for high SNR. As in the SD-P 
case for Pd'^d,ML(p), we can immediately conclude that the SNR exponent of Pd'^d,oo(p) equals 
for any non-zero b. There is, however, an SNR gap between Pd'^d,cxD(p) and Pd'^d,ML(p)5 
which can be quantified as follows. We start by evaluating [23, Eq. (20)] for the case at hand 
to get 

Pd'^d,ML(p)>2^f ^Jfl + P^j =LBml(p). (22) 
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The asymptotic SNR gap between UBoo(p) and LBmlIp), denoted as [3, i.e., UBoo(p) ~ 
LBml(p//3), p — oo, is directly obtained as 



/? = 4(v^+l)' 



1 (2N 
2\N 



(23) 



We can thus conclude that the asymptotic SNR gap between the PEP for SD-/°° and the PEP 
for SD-/^ is upper-bounded by [3, or, equivalently, we have 



Pd'^d,oo(p) ^ Pd'^d,ML(p//?), P ^ OO. 



(24) 



2) Total Error Probability: In the following, we consider the total error probability P£(p) = 
P[d' 7^ d] assuming equally likely transmitted data vectors d'. If not specified, P£-(p) stands for 
the total error probability P£:^(p) of SD-/°° and P^-^lIp) of SD-/^. We start by noting that 



(25) 



Here, P£:|d'(p) refers to the total error probability conditioned on d' being transmitted, which 
can be bounded as 

Pd'^anyd(p) < Pf|d'(p) < Pd'^d(p). (26) 

d^d' 



It follows that 



and 



d' dT^d' 

P.(p)>|^|-'^5^Pd'^anyd(p). 



(27) 



(28) 



As the SNR exponent of Pd'^d,oo(p) equals for all b = d — d' 7^ (cf. pT])), we can conclude 
that SD-/°° achieves full diversity order N and hence the same diversity order as ML detection. 



The corresponding asymptotic SNR gap is obtained as follows. With ([24|)-([27]), we get 

p^..(p) ^ \A'''Y.Y. Pd'^d,oo(p), p ^ 00 

d' d^^d' 

^ Yl Yl Pd'-d,ML(p//9), P ^ 00 

d' d^d' 

^i^r''EEp^MUd'(p//?), p-00 

d' d^d' 



(29) 
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From pT] ) together with ( [T7] ) and ([28]) together with ( [22] ), we can conclude that Ps^hip) has SNR 
exponent and can be written as ^euhip) — (-^ml p)~^ + o{p~^), p ^ oo, with some constant 
Kml > that does not depend on p. With P^^(p) ^ I^I^^Pf^^lp/Z^) from ^ and N > M, this 
yields P£-^(p) ^ P£-^j^(p/(|^|/3)), which establishes that the asymptotic SNR gap incurred by 



SD-l°° is upper-bounded by \A\P with P specified in ([23]). Furthermore, using (™) > (y)', we 
have (^^) > 2^, which, when employed in ([23]), shows that < 4 {Vn+iY < 16N. Thus, the 
asymptotic SNR gap between the total error probabilities P^-^lIp) and P^:^ (p) is upper-bounded 
by 16|^|A^. We can therefore conclude that the asymptotic SNR gap incurred by SD-/°° scales 
at most linearly in the number of receive antennas. Simulation results (see Section [VI-A ) reveal 
that the actual SNR gap is much smaller than 16|^|A^. We finally note that applying [24, Prop. 
1] shows that the statement on SD-l°° achieving full diversity order for the i.i.d. Rayleigh fading 
case directly extends to more general fading statistics such as spatially correlated Rayleigh or 
Ricean fading. 



3) l°°-Norm Decoding on Full Channel Matrix: As pointed out in Section I-F2 SD-/°° does 



not correspond to /°°-norm decoding on the full channel matrix H according to ( [TT] ). However, 
as the PEP of /°°-norm decoding on the full channel matrix satisfies 



-Pd'^d,oo,H(p) = P 



|Hb + 

oo ^ W oo 



we can apply the upper and lower bounds in ([12]) to arrive at 

1 



Pd'^d,oo,H(p) < P 



Hb + w 2 < w 



which is exactly the same upper bound as that obtained for SD-/°° in ([19]). We can therefore 
conclude that /°°-norm decoding on the full channel matrix H also achieves full diversity order 
with an asymptotic SNR gap, vis-a-vis SD-/^, that increases at most linearly in the number of 
receive antennas. 



III. Complexity of SD-/°° 

In this section, we analyze the complexity of SD-/°° by deriving an analytic expression for 
the average number of nodes visited in the tree search when pruning according to the PBC ([9]) 
is performed. A node is visited if and only if its corresponding PBC ([9]) is satisfied. We 
consider a fixed choice of Coo and average w.r.t. channel, noise, and transmit signal. Based on 
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the analytic complexity expression for SD-/°°, it is then shown that the complexity of SD-l 
scales exponentially in the number of transmit antennas M. 



A. Basic Approach 

Our methodology is similar to that adopted in [9] for SD-P. The key difference to the approach 



in [9] lies in the computation of the partial metric cdfs as detailed in Section III-B 

For a given Coo, a simple counting argument yields the number of nodes S^o^k visited at tree 
level /c, = 1, . . . , M, as 



(30) 



where 



/(zfc(dfc)) 



1, if ||zfe(dfc)||oo < Coo 
0, otherwise. 



We trivially have Soo^k < l^l*^- First, we note that E{/(zfc(d,fc))} = P[||zfc(dfc)||oo < Coo], where 
the expectation is w.r.t. the channel R, noise n, and data vector d'. Consequently, we have 



E{Soo,k} = Y,^[\\Zk{dk)\\oo<Coo] 



(31) 



with the total complexity 



M 



E{Soo} = J2E{Soo,k}. 



(32) 



k=l 



Next, we condition on the data subvector d'^ G and write P[||zfc(dfc)||oo < C*oo | d'^] 

P[\Mhk)\\oo<Coo] with 





bfc + 




. . 







Zfc(bfc) 



where = d'^ — d^ is a pairwise error subvector, 



(33) 



[UM-k+l ■ ■ ■ riM) 



and rii 



T 



[um+i ■ ■ ■ unY. We set z(b) = ZA/(bA/) and note that Zfe(bfc) = ([z(b)]M-fc+i ■ ■ • [z(b)]Ar) 
Formally, for a given d'^, we will often speak of "a node" b^, which, in a one-to-one fashion, 
refers to the node d^ = d'^ — b^ in the search tree. For example, the node d^ = d'^ corresponding 
to the transmitted data subvector d';. is equivalent to node b^ = 0. If we speak of a node b^ 



DRAFT 



November 18, 2008 



D. SEETHALER AND H. BOLCSKEI: INFINITY-NORM SPHERE-DECODING 



15 



without specifying d'^, the corresponding statements hold for all pairs d'f^,dk E A'' satisfying 
bfc = d'^ — dfc. It follows that ( [3T| ) can be written as 

E{^oo,4 = ^EEPO|z^(dfc)l|oo < Coo Id',] 



1^1 



i^^P[||z.(b,)|U<Coo] 



(34) 



where we assumed equally likely transmitted data subvectors d', for all tree levels k = 1, . . . , M; 
this holds, e.g., for statistically independent (across the transmit antennas) and equally likely data 
symbols. The sum in ( [34] ) is taken over all possible combinations of pairwise error subvectors 
bfc. 

Equivalently, the complexity at the A;th tree level E,{S2,k} for SD-P is given by ([34]) with the 
Z°°-norm replaced by the /^-norm and Coo replaced by C2, i.e.. 



E{52,J = ^5^P[||z,.(b,)|b<C2]. 



(35) 



We finally note that P[||zfc(bfe)||oo < Coo] in ([34]) and P[||zfc(bfc)||2 < C2] in (|35]) express the 
probability that node b^ is visited by SD-/°° and SD-/^, respectively, and are equivalent to the 
average (w.r.t. the channel R and noise n) number of visits of node b^ by SD-l°° and SD-P, 
respectively. 



B. Computation of the Partial Metric Cdfs 



From ( [34] ) and ( [35] ) we can see that the computation of Ej^oo./fc} and E{S'2,fc} requires 
knowledge of the cdfs of the partial metrics ||z,fc(bfc)||oo and ||zfc(b,t)||2, respectively (recall that 
the radii Coo and C2 are assumed to be fixed and independent of the channel, noise, and data 
realizations). An analytic expression for P[||zfc(bA;)||2 < C2] was provided in [9], [11]. More 
specifically, it is shown in [9, Lemma 1] that thanks to the invariance of the /^-norm w.r.t. unitary 
transformations 



|zfc(b 



k 2 



Hfebfc + 



Hl J 



where the {k + L) x k matrix H^. with L = iV - M has i.i.d. CA/'(0, 1/M) entries. Conditioned 



on hk, the RV 



Hfcbfc + 



rifc 



is then easily found to be Xfc+L-distributed, which leads to an 



expression for P[||zfc(bfc)||2 < C2] in terms of the lower incomplete Gamma function (see also 
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Section [riI-C|). As the /°°-norm is not invariant w.r.t. unitary transformations, this approach does 



not carry over to the /°°-case considered here. Instead, we follow a direct approach as detailed 
below. 

1) Cdfof ||zfc(bfc)||oo-' Since the nonzero entries in R are statistically independent [25, Lemma 
2.1], the elements of Zfc(bfc) conditioned on are statistically independent as well. We thus 
have 



k+L 



p[||zA.(bfc)||oo < c^oo] = Hp <c. 



i=l 



(36) 



The bottom L elements of Zfc(bfc) are given by the i.i.d. C7V(0, cr^) vector (see ([33])) so that 



[zfc(bfc)]i| < Co 



71 



i = k + 1, 



k + L, 



which yields 



P[||z^-(bfc)||oo<Coo] 



71 



L k 



JJP |[z(b)]M-™+i| <a 



-' m=l 



(37) 



can be 



2) Cdf of I [z(b)]A/_m+i|.- An analytic expression for P | [z(b)]A/_m+i| < Co 
obtained via direct integration using the fact that the nonzero entries of R are statistically 
independent with V2MRi^i ~ X2(N~i+i) and R,j ~ CAf{0, 1/M), for i = 1, . . . , M, j > i [25, 



Lemma 2.1]. In Appendix |a| it is shown that P |[z(b)]A^ 



jM-m+i| < Coo is a binomial mixture of 
X-distributions with degrees of freedom reaching from 2 up to 2(m + L). More specifically, for 
m = 1, . . . , M, we have 



<Co 



m+L-1 
1=0 



C. 



2 

oo 



|b„||2/M + a2 

with the coefficients Bi{hm) given by the binomial probabilities 
with parameter 



(38) 



m+L — 1 
I 



(p(bJ)Hl-j9(b„))"+^-i-' 

hm-l\\l + M(T^ 



b„||2 + Ma2 



(39) 



(40) 
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and ||bo||2 = 0. In [13] the pdf of the RV | [z(b)]A/-m+i | associated with the distribution ( |38| ) 
was obtained in a different form (i.e., not in terms of a binomial mixture of ^-distributions) 
using an alternative derivation. More specifically, the derivation in [13] exploits the property 

I 2 I 2 

||zm(b„)||^ = ||z^_i(b„_i)||^ + |[z(b)]M-m+i| with ||z„_i(b„_i)||2 and | [z(b)]M-m+i | being 
statistically independent and the MGFs of ||zm(bm)||2 and ||zm_i(bm-i)||2 being known from 

I 1 2 

[9]. This allows to compute the MGF of | [z(b)]Af_m+i| and, via the inverse Fourier transform, 
the corresponding pdf, which can then be used to establish ( [38] ). Finally, we note that the direct 
integration approach used in this paper to obtain ( [38] ) can, in contrast to the approach employed 
in [13], be applied to derive the distributions of | [z(b)]R and |[z(b)]i_A/-m+i|, which are 
needed to compute (bounds on) the complexity of SD-/°° (see Section V-B for more details). 

3) Sum Representation and Moment Generating Function: The binomial mixture representa- 
tion ( [38] ) allows for an interesting alternative representation of the RV | [z(b)]M-m+i | as the 
sum of independent RVs. In particular, using results from [26], it is shown in Appendix |B] that 

\[z{h)]M-m+lf =tl (41) 



where 

|2//\^ I ^2 



tm = ^ 7 + 2^ A 



m+L-1 

(42) 



2 

\ 1=1 

with the independent RVs 7^ and A^, z = 1, . . . , m + L — 1. Here, 7^ ~ xl with pdf f^^{x) and 
the have the mixture pdf 

h2{x) = {l-p{hJ)f^2{x)+p{hJ6{x) (43) 

or, equivalently, with probability p{hm) the A^ come from a population having pdf S{x) (i.e., they 
are zero with probability p{hm)) and with probability 1 — p{hm) they come from a population 
having a X2 distribution. 



Besides being interesting in its own right, the representation (42) allows to compute the MGF 



2 

of I [z(b)]M-m+i| or, equivalently, of in a straightforward manner, by using ( |40l ) and ( [43] ), 
as (cf. ^) 



\h^\\l/M + a^)s 



|b„||2/M + a2)s 



[l-(||b^_,||i/M + a^).]-+^-^ 
[l-(||b„||i/M + a2),]-+^ • ^ ^ 
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C. Cdf of \\2k{hk)\\2 



Using the results of Section III-B we can directly recover the cdf of ||zfc(bfc)||2 obtained in 
[9]. The derivation in [9] is explicitly based on the rotational invariance of the /^-norm (see [9, 



Lemma 1]). Here, we follow an alternative approach and start by using ( [33] ) to obtain 



|Zfc(bfe)||2 



|n,||^ + ^|[z(b)] 



M-m+l 



m=l 



Thanks to (41 1 we then have 



W{h,)\\l^\\n,\\l + Y,tl 



m=l 



Since the RVs ||nL||2 and t^, m = 1, . . . ,k, are mutually statistically independent, the MGF of 
||zfc(bfc)||2 can be written as 



'^'iiz,(b,)iii(s)=$ii„,iii(s)nM^) 



m=l 



where $||„^j|2(s) denotes the MGF of ||n/^||2 given by (cf. (|2])) 



(45) 



$llv.,l|2fs 



(46) 



Inserting (44) and (46) into (45), then yields 



*l|zfe(bfe)||i(s) 



m=l 
1 



[l-i\\h ra.,\\l/M + a')s 
\h^\\l/M + a^)s 



2^ „lm+L-l 



2\ „lm+L 



[l~i\\h,\\l/M + cr^)s] 



k+L 



which is the MGF of a X2(fe+L) -distributed RV. Consequently, the cdf of ||zfc(bfc)||2 is given by 



P[||Zfc(bfe)||2<C2] =7fc+L 



CI 



\hk\\l/M + a^ 



(47) 



which is what was found in [9], [11]. 
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D. Final Complexity Expressions 

We are now ready to assemble our results to get the final complexity expressions for SD-/'^ 
and SD-/^. Inserting ( [38] ) into ([37]) and using ([34]), we obtain 

"'2 \ ^ m+L—l 



1 



1^1' 



71 



En E 

bft m=l 1=0 
Bl{hm) Im+L-l I 



ri2 



1 1 2 



/M + 



(48) 



The corresponding total complexity follows from ([32]). In comparison, for SD-Z^, using ( [35] ) and 
(|47]) yields [9], [11] 

2 \ (49) 



The total complexity for SD-l^ is then obtained as 



M 



E{52} = 5^E{52,fc}. 



(50) 



A;=l 



E. Choice of Radii 

For a meaningful comparison of the complexity of SD-l°° and SD-P, the radii Coo and C2 
have to be chosen carefully. In our analysis below, we use the approach proposed in [4], [9], 
[17] for SD-/^, where the choice of C2 is based on the noise statistics such that the probability 
of finding the transmitted data vector inside the search hypersphere is sufficiently high. Recall 
that our complexity analysis assumes a fixed choice of the radii that does not depend on the 
channel, noise, and data realizations. We start by noting that ||z(d')||2 = ||n||2, which is X2N- 
distributed. Choosing the radius C2 such that the transmitted data vector d' is found inside the 
search hypersphere with probability 1 — e G [0,1] is accomplished by setting 



P[l|nl|2 < C2] =Jn(^ 



CI 
a2 



1 -e. 



Solving ([51]) for yields 



(51) 



(52) 



For the SD-l°° case, we adopt an analogous approach arguing that we choose Coo such that 
d' is contained in the search hypercube with sufficiently high probability. Specifically, for the 
complexity comparisons in the remainder of the paper, we choose the radius Coo such that the 
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probability of finding the transmitted data vector d' through SD-Z°° equals that for SD-/^. This 
is accomplished by setting 



P[||n||oo <Coo] 



71 



2 \ ^^ 



1-e 



which results in 



(53) 



(54) 



For any e > 0, for both SD-Z^ and SD-Z°°, there is a nonzero probability that no leaf node is found 
by the detector, i.e., ||z(d)||2 > C2 or ||z(d)||oo > C'oo, respectively, for all d G A^^ . Stopping 
the detection procedure and declaring an error in this case, it follows that the corresponding 
SD does not implement exact SD-/^ (i.e., ML) or exact SD-/°° detection, respectively. To obtain 
exact ML or SD-/°° performance, the corresponding SD algorithm has to be restarted using a 
schedule of increasing radii (or equivalently a schedule of decreasing values for e) until a leaf 
node is found within the search hypersphere or hypercube, respectively (see, e.g., [9], [27] for 
SD-Z2). 

Let us investigate the case of SD without restarting as described above and denote any of 
the corresponding SD algorithms, i.e., SD-/^ or SD-/°°, based on a fixed radius C (according to 
either (|52]) for SD-/^ or ([54]) for SD-/°° with a fixed e) as SD-NoR. Denote the corresponding 



error probability as P£,sd-nor(p) = P 



d ^ d' 



which 



d 7^ d'; C in contrast to P£:,sd(p) = P 
denotes the error probability of exact SD-/°° or SD-Z^, implemented, e.g., through restarting 
using a schedule of increasing radii as explained above. In the following, we show that the 
consequence of not restarting the SD is an error floor of e. In the remainder of this section, || ■ || 
stands for either the Z°°- or /^-norm. By the law of total probability and recalling ( [53] ) and ( [5T| ), 
we have 



Pf ,sd-nor(p) = P d^d';C|||n||<C 



1-e) 



d 7^ d';C| ||n|| > C 



(55) 



and, similarly. 



P£,sd(/o) = P d ^ d'l ||n|| < C 



'l-e)+P 



d ^ d'l ||n|| > C 



(56) 



In the case ||n|| < C, SD-NoR and exact SD yield identical results since at least one 
data vector (namely the transmitted data vector d') is found inside the search space with 
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radius C. Hence, P 



d ^ d'l ||n|| < C 



d ^ d';C| ||n|| < C 
1 - e) < P^,sd(p) by ([56]), yields 



d ^ d'l ||n|| < C 



d ^ d';C| ||n|| < C 



:i-e)< P, 



(P) 



, which, together with 



(57) 



for the first term on the RHS of (55 1. In the case 11 nil > C, the transmitted data vector d' is not 



found inside the search space with radius C. Thus, for SD-NoR, this case will certainly result 



in an error, i.e. 



d7^d';C| ||n|| >C 



1 



(58) 



which together with ( [57] ) yields the following upper bound on the error probability of SD-NoR: 

P£-,sd-nor(p) < P£-,sd(p) + e. 



Using ( [58] ) in ( [55] ) immediately yields the lower bound 

P^.sd-norIp) > e. 



Since limp^oo Pf,sD(p) = (see Section II-B2), we have limp^oo P£-,sd-nor(p) = e- We can finally 
conclude that if the system operates at a target error rate that is much higher than this error 
floor, a fixed radius and the absence of restarting will have a negligible impact on the total error 
probability. 



F. Asymptotic Complexity Analysis 

In [14] it is shown that the complexity of SD-P scales exponentially in the number of transmit 
antennas M. Motivated by this result, we will next show that the complexity scaling behavior of 
SD-l°° is also exponential in M. For simplicity of exposition, we set M = in the following. 

1 ) Impact of Choice of Radius: The asymptotic complexity scaling behavior of SD-P is 
studied in detail in [14], where it is shown that ]E{S'2} > e"'^'^ for large M and some 7 > 0. This 
result is derived under the assumption of C| increasing (at least) linearly in M, which guarantees 
a nonvanishing probability of finding at least one leaf node inside the search hypersphere [28, 
Theorem 1]. It is furthermore shown in [28] that the exponential complexity scaling behavior of 
SD-/^ extends to the case where the sphere radius is chosen optimally, i.e., when the radius is 
set to the minimum value still guaranteeing that at least one leaf node is found (this would, of 
course, correspond to a genie-aided choice of the sphere radius since it essentially necessitates the 
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knowledge of the ML detection result). We finally note that C| chosen according to ( [521 ) results in 
hnear scaling in M for large M. For a proof of this statement the reader is referred to Appendix 
[pj Linear scaling of C| in M is also obtained, for example, by setting C| ocE{||n||2} = cr^M 
as was done in [9], [14]. 

For SD-/°° it is shown in Appendix [d] that the radius according to ( [541 ) scales 
logarithmically in M for large M. We will next show that this is also the case if is chosen to 
be proportional to E{ ||n||^}. Consider the M i.i.d. "distributed RVs 2?/j ~ ^ = 1; • • • > ^■ 
Then, [29, Eq. (2.5.5)] 

^ 1 

max{?/i, 1/2, . . . , Vm} = Y] -Zi 

i=\ 



% 



where the RVs Zi, % = 1,...,M, are also i.i.d. with 2zi ~ xi- Since we have ||n| 
(T^ max{?/i, ?/2, • • • , 2/m} and Ej^;,} = 1, we obtain 



2 

oo 



E{||n||L}=a^i^, 



M 



with i^M = ^i=i 1/^ denoting the Mth harmonic number. For large M and with (3 ^ 0.5772 
denoting the Euler-Mascheroni constant, we have Hm = P + ln(M) + C(M~^), M — > cx) [30], 
which establishes the result. At first sight, the logarithmic scaling of in M versus the linear 
scaling of C| suggests a difference in the asymptotic complexity behavior of SD-/°° and SD- 



While the complexity and pruning (see Section IV) behavior for finite M are indeed quite 
different in general, we will, however, next show that SD-l°° also exhibits exponential complexity 
scaling in M. 

2) Lower Bound on Complexity: Computing the asymptotics of the exact SD-/°° complexity 



expression ((|48l) together with ([321)) seems involved. We therefore tackle the problem by com- 
puting a lower bound on complexity and by showing that this lower bound scales exponentially 
in the problem size M. Our technique can readily be extended to the SD-P case resulting in an 
alternative, w.r.t. [14], proof of the exponential complexity scaling behavior of SD-/^. We note, 
however, that while our proof seems to be shorter and more direct, the result in [14] is more 
general in the sense that it applies to MIMO channels with very general fading statistics. Our 
approach, in contrast, explicitly hinges on the channel matrix H being i.i.d. Rayleigh fading. On 
a conceptual basis, our proof is more closely related to the approach in [13], where bounds on 
the complexity of SD-/^ (and variants thereof) are studied. 
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We start by focusing on the expression for P[||zfc(bfc)||oo < Coo] , k = 1, . . . , M, obtained by 



inserting ([38]) into the RHS of (37). Considering only the summand with index / = m — 1 in 



( |38| ), we obtain (recall that L = N-M = 0) 

k 



p[iMb.)iu<cj>n(^tg^f 

m=l \ ^ 



m— 1 



Using ( |134| ) in Appendix |C] according to 



71 



■ n2 

> 7l 



7i 



Cl 



|b„||2/M + a2 



y ||b„||2/M + a2 



we get 



P[||Zfc(bfc)||oo<Coo] > 



7i 



7i 



n2 
a2 



n 



m=l 



1 + 



(||b„||2/M + a2)™ 



Ma2 



Furthermore, by ( |128[ ) in Appendix |C] we have 7a(x) < 1, x > 0, so that 



> 



'^2 



A/ 



(59) 



(60) 



(61) 



Inserting the specific choice of according to (54) into the RHS of (61), we obtain 



7i 



Cl 



a" 



> 1 -e 



and hence ( 60 ) becomes 



PO|z-fc(bfc)||oo<Coo] >(l-e)(^l 



IK l|2\-'= 



With (34) and (32) we then obtain 



M , 

E{^oo}>(i-e)j:^$:(^i+ 



IK l|2\-fc 
\^k\\2 



which can be further simplified using 



|b,||2<52e(bfe) 



(62) 



(63) 



where 5^ = max |c?' — (iP is the maximum Euclidean distance in the scalar symbol constellation 
and ^(bfc) = .^(d'^jdfc) denotes the Hamming distance between and d'^, i.e., the number of 
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non-zero entries (symbol errors) in = d'^, — d^. Note that every data vector d'^ induces the 
same set of Hamming distances {^(d'^^, dk), d^ G A''}. From ( [63] ), we therefore get 



M 



k=l dfc 
M k 



'I 



M(t2 ) 



for any d'^ 



fc=l j=0 

where, in the last step, all terms having the same Hamming distance ^(d'^,,dfc) = i have been 
merged. Here, Wi = (^)(|^| — 1)' > (^) denotes the number of data vectors d^ G A'' that have 
Hamming distance i from d'^. Furthermore, with (^) > i = 1, . . . , fc, we get Wi > (|)\ 
so that 



M k 



1 + 



Ma' 



(64) 



nsoo] > {I - e)Y,Y. 

k=l i=l 

where the i = (Wq = 1) term is omitted for all k. 

Lower Bound on Complexity for SD-P: As already mentioned, the technique used to derive 
( |64l ) can readily be extended to the SD-/^ case. We start by considering P[||zfc(bfc)||2 < C2], 
k = 1, . . . , M, given in ( |47| ) and applying the lower bound ( |134[ ) in Appendix |C| to obtain (recall 
that L = - M = 0) 



P[||Zfc(bfe)||2<C^2] =lk 



CI 



|b,i|i/M + a2 
Employing ( |127[ ) in Appendix |C] and using ( [52] ), we get 



>7M 



(CI 



1 



|bfc| 



Ma2 



7fe 



^2 



> iM 



ci 



a 



which yields 



P[||zfc(bfe)||2<C^2] >(l-e) 1 + 



|bfc| 



and finally, by (49) and (50), results in 



M . 

E{52}>(l-e)5:p^|:(^l 



|Dfc||2 



Ma2 



(65) 



The RHS of ( [65] ) is precisely the lower bound ( [63] ) on ElSoo}- Consequently, the simplified 



lower bound ([64]) on Ej^oo} is also a lower bound on E{S'2}. 
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3) Asymptotic Analysis of Lower Bound: In the following, we show that the lower bound ( [64) ) 
exhibits exponential scaling in the system size M = N, which, together with the trivial upper 
bound El^oo} < 1^1^"'^^^, establishes exponential complexity scaling of SD-/°° (and of SD-P 
together with ^{82} < |^|*^+^). 

We start by noting that a trivial lower bound on EjS'oo} is obtained by considering only one 



term in the RHS of ([64]), resulting in 



E{Soo}>(l-e)(^y fl + ^l =/(M). (66) 



Evidently, establishing that 



1„„ > (67) 



is sufficient to prove that SD-/°° (and SD-P) exhibits exponential complexity scaling. To this 
end, we set k = \aM] and i = \l3M] with a G ]0, 1] and /3 G ]0, a]. We then have 

log /(M) _ log(l - 6) \JM] f\aMr\ _ \aM] f [/jM] \ 



M M M 



Furthermore, writing \aM~\ = aM + Aq, and [/3M] = /5M + A^ for some values Aq, and A^j 
satisfying < Aq, A/3 < 1 gives 

log/(M) ">^(l-^)^^r,^^^/M)lo,^-^^JM 



M M ' ' PI ^ '^\p + A^/M 



(a + AJM) log ( 1 + ^ + ^Ap/M 



which results in 



lim = /31og(a//3) - alog(l + P/a^) = 7(a,/3). (68) 

Indeed, for any SNR (i.e., for any cr^), there exist values of a and (3 for which 7(0;, (3) > 0. For 
example, with (3 = a/2, any a satisfying 

< a < min|-^2(T2(V2- 1), l| (69) 

results in 7(0;, /5) > 0, which establishes the desired result. 
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IV. Tree Pruning Behavior 
In the previous section, we showed that both SD-/°° and SD-P exhibit exponential complexity 



scaling in M. The analytic results for Ej^oo} and E{S'2} in Section III-D indicate, however, that 
the finite-M complexity can be very different for SD-/°° and SD-P. While it seems difficult to 
draw general conclusions based on the analytic expressions for Ej^oo} and E{5'2}, interesting 
insights on the difference in the corresponding tree pruning behavior (TPB) can be obtained. 
Here, we predominantly focus on the average (w.r.t. channel, data, and noise) TPB; some 
comments on the instantaneous (i.e., for a given channel, data, and noise realization) TPB 
will be made at the end of this section. Our analytic results will be corroborated by numerical 
results in Section IVI-CI 

A. Average TPB 

The average TPB of SD-/°° and SD-/^ will be studied through a high-SNR analysis of the 
probabilities P[||zfc(bfc)||oo < C^o] (see (|37]) with (|38])) and P[||zfc(bfc)||2 < C2] (|47]) of a certain 



node bfe being visited by SD-/°° and SD-/^, respectively. Equivalently, 1 — P[||zfc(bfc)||oo < C*oo] 
and 1 — P[||zfc(b,t) II2 < C2] refer to the probabilities of node being pruned by SD-l°° and 



SD-l , respectively. While (47) shows that the probability of a node b^ being visited by SD-l 



2 



depends only on ||bjt||2, i.e., on the Euclidean distance between and d'^, in the SD-/°° case 
this dependence on seems in general rather involved. However, the high-SNR analysis of 
P[||zfc(bfe)||oo < Coo] reveals simple characteristics of b^ that determine the probability of node 
bfc being visited by SD-l°°, which then enables us to characterize the fundamental differences in 
the TPB of SD-/°° and SD-/^. The corresponding results will be supported by simple geometric 



considerations. Throughout this section, the radii Coo and C2 are chosen according to and 
( |52l ), respectively, and we define 

«:oo = ^ = -log(l - VT^e) (70) 

«2 = ^ = 77V (1 - e) • 
1 ) High-SNR Analysis: Consider a node[^ b;,. 7^ at tree level k and denote the index of the 
corresponding first tree level exhibiting a symbol error by m(bfc) G [1, 2, ... , k]. More precisely, 

' For bfc — the high-SNR behavior of P[|lzfc(bfc)|]oo < Coo] and P[||zfe(bfc)|]2 < C2] is trivial since the expressions 
P[l|zfc(0)||oo < Coo] = [7i(koc)]''+^ and P[||zfc(0)||2 < C2] = 7fe+i(«:2) do not depend on the SNR p. 
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we have [hk]k-i+i = 0, for i = 1, . . . ,m(bfc) - 1 and [bfe]fc_a(bfc)+i 7^ 0. In Appendix |e| it is 
shown that 



P[i|zfc(bfc)|U < Coo] ~ A(m(b,))«:^+^(p||bfc||2/M) 



-(k+L) 



p — s> OO 



(71) 



where 



A{m{hk)) = bil^oo)] 



m(bfe)-(-L-l ,^ 
m(bfc)+L-l ^ lm[t)k)+L—i 

I y fm(bfc)+L-/)! 



i=0 



(72) 



Note that A(m(bfc)) does not depend on the SNR p. Furthermore, using ( |125[ ) in ( [47] ), we 
directly obtain a corresponding result for SD-/^ as 

1 



P[K(bfc)||2<C2] ~ 



ik + L)\ 



(Pl|bfc||^/M) 



-{k+L) 



p OO. 



(73) 



From ( [71] ) and ( [73] ) we can infer that the only characteristics of b^, which determine the high- 
SNR probability of node bj. being visited, are ||bfc||2 in the case of SD-l^ and ||bfc||| and m{hk) 
in the case of SD-l°°. Moreover, for SD-/°° the dependence on m(bfc) is through the function 
A{m{hk)) ( [72[ ), which has the following properties. By inspection we get 

1 



lim 74(m(bfc)) 



(m(bfc) + L)! 



(74) 



and, as shown in Appendix F-A 



lim A(m(bfc)) = 1. 



Koo— »0 



(75) 



Note that k^o oo for e ^ and k^o ^ for e ^ 1. In Appendix F-B it is furthermore shown 
that A[m(hk)) is a nonincreasing function of k^c, which, together with ( |74| ) and ( [75] ), yields 

1 



(m(bfc) + L)! 



< A{m{hk)) < 1. 



(76) 



The lower bound in ( [76| ) allows us to conclude that, in the best case, the high-SNR probability 
of SD-/°° visiting node b^ decreases as l/((m(bfc) + L)\) for increasing m(hk). This suggests 
that nodes corresponding to a first symbol error at high tree levels, i.e, nodes with large m(bfc), 
are in general pruned with higher probability than those corresponding to a first symbol error 
at low tree levels, i.e., nodes with small m(bfc) (provided, of course, that ||bfc||2 is constant in 
this comparison). 
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2) Average TPB Comparison: Let us next compare the high-SNR TPB of SD-/°° to that of 
SD-P. We start by defining 

PC = (77) 



For bfc 7^ 0, the resuks in fTT] ) and f73| ) imply 

P[||zA:(bfc)||oo < Coo] < P0|Zfc(bfc)||2 < C2] , p ^ OO (78) 

if 

^(^(b.))<^^^P^+^ (79) 

and vice-versa, i.e., 

P[||zA:(bfc)||oo < Coo] ^ P0|Zfc(bfc)||2 < C2] , p ^ OO (80) 

if 

A{jh{h^)) > jj^^ p^^\ (81) 

Hence, the high-SNR average pruning probability of a node b^ 7^ for SD-/°° as compared to 
SD-Z^ is entirely described by the two functions 74(m(bfc)) and + L)! p^"*"^, = 1, . . . , M. 



Since y4(m(bfc)) < 1, the condition in ( |79l ) is certainly satisfied for a// nodes b^ 7^ and tree 
levels k = 1, . . . ,k, with k being the largest integer satisfying 

'V{k + Ly. < PC. (82) 



fe+i 



We set /c = if no integer satisfies ([82]). Using ( [78] ) in the expressions for EjSoo.fc} ([34]) and 
E{52,fe} ([35]) for the terms with b^ ^ ancQ p[||zfc(bfc)||oo < Coo] < P[||zfc(bfc) II2 < C2] for 
the terms with = 0, we can now infer that 

E{5oo,fc} ^ E{S2,k}, P ^ 00 (83) 

for A; = 1, . . . , fc, or equivalently, in the high-SNR regime, the average number of nodes visited 
by SD-/°° up to tree level k (corresponding to tree levels closer to the root) is smaller than that 
for SD-P. Furthermore, if A; = M 

E{Soo} ^ E{S2}, p ^ 00 (84) 

^ As already noted in footnote 2, we have P[||zfc(0)||oo < Coo] = [7i(«:oo)]''^^ and P[||zfc(0)||2 < C2] = lk+L(j^2), where 
[7i(«:oo)]^ = 7jv(k2) = 1 - e due to ^ and l|5T|. It follows that the condition P[||zfc(0)||oo < Coo] < P[||zfc(0)||2 < C2] 
is equivalent to [71(^00)]'°^^ < 7fc+L(K2). Furthermore, using 7i(koo) = (1 — e)^^'^ — [7jv(k2)]^'''^, this condition can be 
written as [7]v(k2)]^/^ < [7fc+L(K2)]^'^'''+^', which according to ( fT29t holds for all fc = 1, . . . , M. 
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since (1831) then holds for all tree levels k = 1 M. We will next show that for the radii 



chosen according to ([54]) and ( [52| ), we can, indeed, have A; = M. In the following, we write 
pc(e) to emphasize the dependence of the radii ratio on the parameter e G [0, 1]. In Appendix 
Glit is shown that pc(e) is a nondecreasing function of e and furthermore 



lim pc{e) = 1 and lim pc{e) = \/N\ (85) 

e— >0 e— >1 

which implies 

1 < Pc(e) < v^. (86) 



We can therefore conclude that /c is a nondecreasing function of e taking on any value in [0, M] 
(achieved by varying the parameter e) with the following two extreme cases: 



• For e ^ 1, we get k M so that ([84]) holds. This indicates that in the high-SNR regime 
SD-/°° will have a smaller total complexity than SD-P if e is sufficiently close to 1. 

• For e ^ 0, we have A; — > 1 for L = and A; ^ for L > 0. Equivalently, if e is sufficiently 



close to 0, ( [83] ) holds for the first tree level if L = and holds for none of the tree levels 
if L > 0. In particular, for e — 0, we have pc(e) 1 and hence Coo ~ C2, e — > 0. This 
implies that the hypercube of radius Coo contains the hypersphere of radius C2 and the total 
complexity of SD-/°° will trivially be higher than that of SD-P. In general, SD-/°° will have 
a higher total complexity than SD-P if e is small. 
In summary, varying the parameter e has a significant impact on the total complexity of SD-/°° 
relative to that of SD-P. In particular, the total complexity of SD-l°° can be higher or lower 
than that of SD-P. 

Let us next study the average TPB of SD-/°° as compared to SD-l^ for the tree levels k = 
k + 1, . . . , M . Here, we have l/{{k + Ly.)p^^ < 1 so that condition ( fTO] ) will not necessarily 
be satisfied for all nodes b^. 7^ 0. This means that for tree levels A; > A; + 1, in the high-SNR 
regime, SD-P may prune certain nodes with higher probability than SD-/°^. Since A{m{hk)) > 
l/(m(bjfc) + L)\, the condition in ( [8T] ) is certainly satisfied for all nodes b^. A; = A; + 1, . . . , M, 
with m(bfc) = 1, . . . ,Wi{k), where fn{k) is the largest integer m E [1, k] satisfying 

(^^^Ti)! > {kTiy-^''^'' ^^^^ 

A high-SNR statement dual to ([83]) based on ((87|) can, in general, not be given for tree levels 



A; > A; + 1 as ( [87] ) applies only to a certain subset of nodes at a specific tree level A; > A; + 1. 
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Some insight can, however, be gained by studying the cardinalities of these subsets at high 



tree levels. Let us denote the cardinality of the set of nodes hk satisfying ( [80| ) for some given 
transmitted data subvector d'^ (recall that = — d'^ with dfc,d'^ G A'') by Sk- From the 
previous paragraph we know that this set includes for sure all nodes with m(bfc) = 1, . . . ,m{k), 
k = k + 1, . . . , M. Hence, we have 

m(k) 

S~k> ^(1^1-1)1^1'=-'" = - |^|-™(')), k = k + l,...,M. 

m=l 

For fn(k) > 1 (which is always the case for L = and all tree levels k > k + 1) and |^| > 2, 
we see that more nodes at tree level k > k + 1 will be pruned with a higher probability by 
SD-/^ than by SD-/°°. Even more, since the RHS of ( [87] ) is a decreasing functiorj^of k for all 



k >k + 1, m{k) is a nondecreasing function of k that becomes large if k is large. In this case, 
most out of the \A\'^ nodes at tree level k will be pruned with a higher probability by SD-/^ 
than by SD-/°°. Note, as shown above, that this behavior is reversed at tree levels close to the 
root (i.e., up to tree level k), where all nodes are pruned with higher probability by SD-/°° than 
by SD-/2. 

3) Relation to Geometric Properties: The results on the average high-SNR TPB are nicely 
supported by simple geometric considerations. We now assume L = N — M = Q and argue that 
the average number of visited nodes at tree level k is roughly determined by the volume of the 
involved search space of dimension k (see, e.g., [9]). In the SD-/^ case the search spaces are 
hyperspheres, whereas in the SD-/°° case they are hypercubes. We will next see that analyzing the 
volume behavior of the hyperspheres and hypercubes associated to SD-/^ and SD-/°°, respectively, 
as a function of the dimension, or equivalently, as a function of the tree level k, recovers many 
of the insights obtained in the previous section. 

For SD-/^, the search space at tree level A; is a hypersphere of radius C2 in 2k real- valued 
dimensions with volume (e.g., [31]) 

k\ 



^ This can be proved by showing that fc + l + L > pc, ior k > k + I. Applying the definition of k in j82[ l, we get 
E+i+i^j-^ +! + -£/)! > PC, which together with \/n). < n, for n G N, establishes the desired result. 
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For SD-/°°, the search space consists of the set of all k pairs Xi^i,Xi^2 G M, i = 1, . . . , /c, that 
satisfy x'^^^ + < C^, with the corresponding volume 

Voo,k = Ti'iCi,)'. (89) 



From ( [88] ) and ([89]) it follows that Voo,k < V2^k for all tree levels k = 1, . . . ,k with k being 
the largest integer satisfying 

\^.<Pc (90) 

and vice-versa, i.e., \4o,fe > V2,fc for A; = A; + 1, ... , M. This indicates that SD-/°° prunes more 
nodes than SD-P at tree levels closer to the root, whereas this behavior is reversed at tree 
levels closer to the leaves. Even more, the threshold tree level k defined by &0h (found through 



analyzing the volume behavior of the search spaces) equals the threshold tree level ( [82| ) found 
through a high-SNR analysis of the pruning probabilities. 

B. Instantaneous TPB 

The insights and results on the average TPB found in the previous section extend, to a certain 
degree, to the instantaneous TPB (i.e., the TPB for a given channel, data, and noise realization). 
Recall that a node is pruned by SD-/°° if ||zfc(bfc)||^ > and by SD-/^ if ||zfc(bfe)||2 > C|. 



Noting that Zfc(bfc) is a length k+L vector and applying ([12]) yields 

(fc+L)||z,(bfc)||L>||z,(b,)||^. 
A node pruned by SD-/^ is therefore guaranteed to be pruned by SD-/°° as well if 

k+L-^^- 

Consequently, we have 5*00,*; < 5*2, ^ for = 1, . . . , fci with 

^i = max{[pcj -AO}. (91) 

We can therefore conclude that SD-/°° prunes (in an instantaneous sense) more nodes than SD-/^ 
at tree levels close to the root, more specifically, for all tree levels up to level ki (cf. Section 



IV- A2 for the corresponding result in terms of average TPB). We furthermore note that the radii 
ratio pc not only determines the average TPB but also the instantaneous TPB. 

Next, let us compare the instantaneous and the average high-SNR TPB results quantitatively. 



We have Soo,k < S2,k, for A; = 1, ... , ki, with ki defined in ( [91] ), while in terms of the average 
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TPB, we have EISqca,} ^ ¥,{S2.k}, p ^ cso, for k = 1, . . . , k, with k defined in ([82]). Due 
to ''+{/{k + L)\ < k + L (since, evidently, {k + L)\ < {k + L)''^^), we obtain ki < k, which 
shows that the instantaneous TPB result Soc,k < S2,k extends, in general, to fewer tree levels 
than the average TPB result EISqca,} ^ E{S'2,fc}, p oo. This, of course, makes sense since 
Soo,k < S2,k implies Ej^ocfc} ^ E,{S2,k}, P ^ oo, but not vice-versa. 



V. The Truth and the Beautiful: /°°-Norm SD 

As already mentioned, the SD-/°° VLSI implementation in [7] is actually based on the 
norm rather than the /°°-norm; the corresponding tree search is conducted using the recursive 
metric computation rule ||zfc(dfc)||55 = max{ ||zfe_i(dfc_i)||s;, || [z(d)]M-fc+i||5ci} together with 
the partial BC 

||zfe(dfc)||55 < C55 (92) 

where C55 denotes the "radius" associated with SD-l°°. Consequently, SD-l°° finds all data 
vectors d satisfying ||z(d)||55 < C55 and chooses, within this set, the vector 

dss = arg min ||z(d)||55. (93) 

We next show how the error probability (see Section |II]) and complexity (see Section [111]) results 
obtained for SD-/°° carry over to SD-/°°. Most results in this section are based on the simple 
inequalities 

ImI<MI6<ML xgC^. (94) 
A. Error Probability of SD-l^ 



a) Distance Properties: Combining (94) with (12) and following the steps in (13) yields 



||r-Hd55||2<2iV||r-HdML||2 

which shows that, compared to SD-/°°, we essentially incur at most a factor of \/2 increase in 
terms of the distance ||r — HdgsH^ realized by SD-/°°. 
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b) Diversity Order and SNR Gap: With ( |93| ), an upper bound on the PEP of SD-/°° is 
given by 

Pd'-.d,55(p) <Pf||z(d)||55< ||z(d')|k 



Next, following the steps in ([18]) - (|20]) for SD-/°°, using the bounds (|94]) and ([12]), yields 

1 



Pd'^d,55(p) < P 



W 2 > 



/2iV+l 



iHbl 



(95) 



Employing the same arguments as in the SD-l°° or in the SD-l case in Section II-Bl 



we 



can conclude that Pd'-*d,5o(p) has the same SNR exponent as Pd'^d,cxD(p) and Pd'->d,ML(p)- 



Furthermore, from ( [95] ) we obtain Pd'^d,55(p) < UB55(p), where UB55(p) is given by UBoo(p) 
in ( [2T| ) with the factor \/N replaced by \^2N. Accordingly, the asymptotic SNR gap /5 between 
UB55(p) and LBml(p), as defined in ( [22] ), i.e., \JB^{p) ~ LBml(p//3), p ^ oo, is given by f3 
in ( [23] ) with the factor replaced by V2N. This corresponds to an increase of a factor of 
roughly two in the corresponding upper bound on the SNR gap as compared to that achieved 
by SD-/°° ([23]). Finally, employing the arguments used in Section II-B2[ these statements carry 
over to the total error probability in a straightforward fashion showing that SD-l°° (like SD-/°° 
and SD-/^) achieves full diversity order N with an asymptotic SNR gap to ML detection that 
increases at most linearly in A^. 



B. Complexity of SD-l"^ 



With ([92]) and following the steps ([30]) - ([34]), we obtain the complexity ¥.{S^^k} of SD-/°° 
at tree level k as 

^ " ^ ^ (96) 



with the total complexity given by ElSgs} = J2h=i ^{S^,k}- As in the case of SD-l°°, invoking 



the fact that the elements of Zfc(bfc) (conditioned on b^) are statistically independent (cf. ([36])), 
we get 



k+L 



(97) 



4 = 1 



The real and imaginary parts of the bottom L elements of Zfc(bfc) are i.i.d. A/'(0,(j^/2) so that 

2 



[zfc(bfc)]i 



7i 



i = k + 1, . . . ,k + L, 
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which, upon insertion into ([97]), yields 



P[||Zfc(bfc)||55<C55] 



2\ O 



2L k 



Hp ||[z(b)]M-™+l||~<C5 



m=l 



(98) 



can be obtained if 



analogously to ( |37] ). An analytic expression for P || [z(b)]A/_m+i||~ < 
hM-m+i is purely real, purely imaginary, or e^wa/ to zero. For these cases the real- and imaginary 
parts of [z(b)]M-m+i are statistically independent, which gives (see Appendix [h|) 



[z(b)]M- 



m+l 



71 



al. 



s=0 



CI 



(T, 



(99) 



where Dsihm) is defined in ( |1 14[ ) and is specified in ( |108[ ). For the general case of bM~m+i 
having a nonzero real and a nonzero imaginary part, i.e., &R,A/-m+i 7^ and 6i,M-m+i 7^ 0, 
the real- and imaginary parts of [z(h)]M-m+i are statistically dependent, which seems to make 



it difficult to find a closed-form expression for P ||[z(b)] 



J 



resort to upper and lower bounds. In particular, it follows from (94) that 



[z(b)], 

Hh)]M- 



m+l 



> p 
< p 



z(b)]M_^+i| <V2Q 



On can, however, 

(100) 
(101) 



The RHS expressions of ( |100[ ) and ( |101[ ) can now be expressed analytically using ([38]). Together 
with ( [98] ) and ( [99] ) this provides upper and lower bounds on P[||zfc(bfc)||55 < C55] and thus 
on ElSss fc}, k = 1, . . . , M, and ElSgs}. We do not display the resulting final expressions as 
they are rather involved and do not contribute to deepening the understanding. Corresponding 



numerical results are provided in Section VI-C2 



Following the choice of the radii for SD-/°° and SD-/^ in ( [53] ) and ( [5T| ), respectively, C55 is 
obtained by setting 



n 



2N 



1-e 



(102) 



which resuks in (cf. (p4b and (52)) 



(103) 
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C. Asymptotic Complexity Analysis 



We next show that SD-/°° with C55 chosen according to ( |103[ ) exhibits exponential complexity 
scaling in the problem size M. This will be accomplished by following the same approach as for 
SD-/°° and SD-P (see Sections III-F2 and III-F3, respectively), i.e., by developing an analytically 



tractable lower bound on Ej^gs} and then establishing that this bound scales exponentially in 
M. For the sake of simplicity of exposition, we set L = — M = in the remainder of this 
section. 

The approach we take is to lower bound the complexity of SD-/°° by the complexity of SD-/°° 
with a suitably scaled radius. Once this is accomplished, exponential complexity scaling of SD-/°° 
can be established by straightforward modifications of the key steps in the corresponding proof 
for the SD-/°° case. We start by applying (|94]) to get P[||n||s; < C^] < P[||n||oo < V^C^], 



which, together with ( |102| ), results in 

PO|n||oo < v^Css] > 1-e. 



(104) 



According to ([53]), we also have P[||n 



< 



C'oo] 



which by comparing with ( |104| ) results 

implies 



in C55 > C00/V2 for any given e. This, together with ||zfc(bfc)||55 < ||zfc(bfe)||oo, 

P[||Zfc(bfe)||55 < C55] > P[||Zfe(bfe)|U < Coo/V^ 



Hence, the complexity of SD-/°° with radius C55 is lower-bounded by the complexity of SD-/°° 
with radius Coo/ V^, where the radii C55 and Coo are related through the parameter e. It remains 
to follow the asymptotic complexity analysis of SD-l°° performed in Section III-F[ where Coo 
is now replaced by Coo /a/2. Invoking the lower bounds ([60]) and ( [6T] ) with Coo replaced by 

C00/V2 , we get 



P[||zA:(bfc)||55 < Css] > 



71 



( 



V2a2 



M 



1 + 



|t>fc||2 





M 


h(i)l 





Noting that 



we furthermore obtain 

P0|Zfc(bfc)||55<C55] > 



1 - 



M 



(105) 



M 



1 + 



M(t2 
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10 15 
SNR p 

Fig. 2. Uncoded total error probability Ve{p) as a function of SNR p for SD-/°°, SD-/°°, and SD-/^ (ML) detection for 
2 X 2, 4 X 4, and 8x8 MIMO system, respectively, and a 4-QAM symbol alphabet. 



Comparing this result with ( [62| ), we can immediately conclude, following the steps ( [62| ) - ( [66| ), 
that E{555} > /(M) with 



/(M) 



i_ 



Evidently, we have 



which implies that 



lim log (1 - V 1 - Vl -e 1=0 



oo M 



where /(M) was defined in ([66]). Finally, following the steps ( |67| ) - ([69]) establishes that the 
complexity of SD-/°° scales exponentially in the problem size M. 



VI. Numerical Results 

In this section, we provide numerical results quantifying some of our analytical findings. 
All the results in the remainder of this section are based on independently and equally likely 
transmitted data symbols. 

A. Error Probability 

We compare the uncoded error-rate performance of SD-/°° and SD-/°° to that of SD-/^ (ML) 
detection by means of Monte-Carlo simulations. Fig. |2] shows total error probabilities ^sip) as 
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Fig. 3. Total complexity E{S} as a function of e for SD-/°° and SD-l^ for a 4 x 4, 6 x 6, and 8x8 MIMO system, respectively, 
and a 4-QAM symbol alphabet at an SNR of p = 15dB. 

functions of SNR p for a 2 x 2, 4 x 4, and 8x8 MIMO system, respectively, using 4-QAM 
symbols in all three cases. We can observe that both SD-/°° and SD-l°° achieve full diversity 
order and show near-ML performance. Indeed, SD-/°° and SD-l°° perform much better than 
suggested by the corresponding upper bounds on the SNR gap (i.e., \A\/3 with /? given by ( |23| ) 
for SD-/°° and \A\/3 with (3 given by (|23]) with the factor replaced by V2N for SD-/°°). 
Consistent with the -\/2-difference in the upper bounds on the corresponding SNR gaps, we can 
observe that SD-l°° performs slightly worse than SD-/°°. Finally, the results in Fig. |2]show that 
the performance loss incurred by SD-l°° and SD-l°° increases for increasing M = N. 



B. Complexity 

Next we consider the complexity of SD-l°° and SD-Z^ for the case of fixed radii Coo and 



C2 chosen according to (p4|) and (52), respectively, with the same value of e in both cases (for 



numerical results on the complexity of SD-l°°, we refer to Section VI-C). The total complexity 
E{S} as a function of e for SD-/°° (see (|32]) with Mh) and SD-P (see ([50]) with (|49])) is shown 



in Fig. |3] for a 4 x 4, 6 x 6, and 8x8 MIMO system, respectively, operating at an SNR of 
p = 15dB. The following conclusions can be drawn from these results: 

• For a given e, the complexity of SD-/°° can be higher or lower than that of SD-P. 

• SD-/°° exhibits a lower complexity than SD-l^ for larger values of e, while for smaller 
values of e, SD-l°° has a higher complexity than SD-/^. This behavior was indicated by the 
high SNR-analysis of the TPB of SD-/°° and SD-l^ (in particular, see the discussion on the 
two extreme cases e ^ and e ^ 1 in Section |IV-A2|). 
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Fig. 4. A{fh{hk)) as a function of m(bfe) = 1, . . . , fc (including tiie corresponding lower bound l/(m(bfc)!)) and tlie RHS 
of ( [79} given by l/(fc!) Pc as a function of fc = 1, . . . , M for e = 10^^ and e = 10^^, respectively, for a 6 x 6 MIMO system. 

• The complexity savings of SD-/°° over SD-Z^ for values of e close to 1 are more pronounced 
for increasing M = N. 

• In practice, e is matched to the target error rate of the system (see the discussion in Section 



III-E). In the present example, we operate at 15dB SNR and the corresponding target error 
rates can be inferred from Fig. |2} which results in e values (target error rates) for which 
SD-/°° has a lower complexity than SD-/^ (cf. Fig. [3]). For the 8x8 system, for example, 
our target error rate at 15dB SNR, according to Fig. |2} is around 10^^. For this case, the 
complexity savings of SD-/°° as compared to SD-/^ are around 25% according to Fig. |3}. 



C. Tree Pruning Behavior 



Next, we quantify some of the results on the average TPB reported in Section IV-Al 
Specifically, we consider a 6 x 6 MIMO system with the radii C2, C^o, and C55 chosen according 
to (|52]), (|54]), and ( [T03] ), respectively, for e = lO'^ and e = 10"^ 

1) High-SNR Results: Fig. |4] shows y4(m(bfc)) in f72] ) as a function of m(bfc) (including the 
corresponding lower bound l/(m(bfc)!)); we also display l/{k\) as a function of k. Recall 
that the high-SNR average pruning probability of a node 7^ for SD-/°° as compared to 
SD-/^ is entirely described by the two functions A{m{hk)), rh{hk) = l,...,k, and l/{k\) p^. 



k = 1,. . . ,M (see ( |78| ) - ([81])). Hence, from Fig. |4] one now can directly infer the high- 
SNR average TPB of SD-/°° as compared to that of SD-P for every node b^, k = 1, . . . , M. 
Considering the case e = 10^^ in Fig. g one can, for example, observe that, in the high-SNR 
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Fig. 5. Complexity E{5fe} as a function of the tree level k for SD-/°°, 80-?°°, and SD-Z^ with (a) e = 10"^ and (b) e = 10'^ 
for a 6 X 6 MIMO system at an SNR of p = 15dB using 4-QAM modulation. For SD-Z°° upper and lower bounds are shown 
(see Section V-B i. Fig. [3] shows the corresponding complexity results for SD-Z°^ and SD-Z'^. 



regime, at tree level k = A SD-/^ prunes all nodes with m(bfc) = 1,2 with higher probability 
than SD-/°° (and vice-versa) or that SD-/°° prunes all nodes hk up to tree level k = 2 with 
higher probability than SD-/^. Furthermore, the following general conclusions can be drawn: 

• For the two considered e-values, the function A(^m{hk)) is close to the lower bound 
l/(m(bfe)!). 

• The function A(^m{hk)) decreases in m(bfc). Therefore, SD-/°° prunes nodes that correspond 
to a first symbol error at high tree levels, i.e, nodes with large m(bfc), in general, with 
higher probability (in the high-SNR regime) than those that correspond to a first symbol 
error at low tree levels, i.e., nodes with small m(bfc) (provided that ||bfc||2 is constant in 
this comparison). 

• The function A(^m(hk)) increases by going from e = 10"^ to e = 10~^ for a given m(bfe) > 



1 (see also Appendix F-B showing that A(m(bjt)) is a nondecreasing function of e). 

2) Complexity Versus Tree Level and Complexity Bounds for SD-l°°: The goal of this section 
is to quantify the level-wise complexities Ej^fc} for SD-/°°, SD-/°°, and SD-/^, as well as 
to illustrate the quality of the upper and lower bounds on the complexity of SD-/°° reported in 
Section V-B Note that for the cases of SD-/°° and SD-P exact complexity expressions according 



to ( [48] ) and ( |49l ), respectively, are available. Fig. |5] shows Ej^fc} as a function of the tree level 
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k for SD-Z^ and for SD-/°° including the corresponding upper and lower bounds on ElSfe} for 
SD-P at an SNR of p = 15dB (Fig.lsfa) for e = lO^^ and Fig.|5|;b) for e = IQ-^). The following 
conclusions can be drawn from these results: 

• At tree levels close to the root (i.e., for small k), SD-/°° (SD-/°°) visits fewer nodes than 
SD-P on average; at tree levels close to the leaves this behavior is reversed. This observation 



is supported by the results on the average TPB reported in Section IV (in particular, see 
( [83] ) and the discussion in the last paragraph of Section [lV-A2[ ). 



The complexity savings of SD-l°° (SD-l°°) over SD-P close to the root extend to higher 
tree levels for the larger e value of 10^^. This behavior is consistent with the average TPB 



analysis in Section IV stating that EISqca,} ^ E{S'2,fc}, p — oo, up to tree level k, where k 



was shown to be a nondecreasing function of e (see Section IV-A2). For example, we have 
= 3 for e = 10-2, while = 2 for e = 10"^ (see also Fig. Q. 

• For e = 10^2, the complexity savings of SD-/°° at tree levels close to the root are dominant 
enough to result in a smaller total complexity of SD-/°° as compared to the complexity 
of SD-/2 (cf. Fig. |3]). For e = 10^^, however, the increased complexity of SD-/°° at tree 
levels close to the leaves outweighs the savings close to the root resulting in higher total 
complexity of SD-/°° when compared to the complexity of SD-P (cf. Fig. [s]). 

• The upper and lower bounds on the complexity of SD-l°° are sufficiently tight to capture 
the essential aspects of the level-wise complexity of SD-/°° since they both show the same 
behavior over the tree levels; as for SD-l°°, we can again observe complexity savings of 
SD-/°° over SD-P close to the root, whereas this behavior is reversed at tree levels close to 
the leaves. Furthermore, for the examples considered, the lower bounds on the complexity 
of SD-l°° show that SD-/°° has a higher total complexity than SD-/°° (see also next Section). 

D. Complexity of Sphere-Decoding with Restarting 



As already mentioned in Section III-E, to guarantee ML or exact SD-Z°° performance the 
corresponding SD algorithm has to be restarted with an increased radius in cases where the 
initial radius was chosen too small for the search sphere (or box) to contain a valid leaf node. 
The same is, of course, true for SD-l°°. To evaluate the overall (across potential multiple SD 
runs) complexity of SD-P, SD-/°°, and SD-l°° we choose an increasing radii schedule obtained 
by setting e = 0.1*, i = 1, 2, . . . , in the ith run of the SD. Corresponding average (w.r.t. channel, 
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Fig. 6. Total complexity versus SNR p for SD-l°°, SD-l°°, and SD-Z^, all with restarting for a 4 x 4, 6 x 6, and 8 x 8 MIMO 
system (for the e-schedule see text), using (a) 4-QAM modulation and (b) 16-QAM modulation. 

noise, and data) complexity results for 4 x 4, 6 x 6, and 8x8 MIMO systems using 4-QAM and 
16-QAM modulation obtained through Monte-Carlo simulations can be found in Fig. [6] We note 
that analytical expressions for the overall complexity of SD (with any norm considered here) 
with restarting are not available since the statistics of the corresponding required number of SD 
runs seem to be difficult to obtain. From Fig. |6] we can observe that in the relevant SNR regime 
(e.g., about lOdB to 15dB for the 4-QAM case corresponding to error probabilities of about 
10"^ to 10"^ cf. Fig. 11) SD-/°° and SD-/°° exhibit lower complexity than SD-P. For example, 
at 12.5dB, we can infer from Fig. [SJ^a) that the corresponding complexity savings of SD-l°° and 
SD-l°° over SD-P are about 30%. Furthermore, it can be observed that the complexity savings of 
SD-/°° and SD-/°° over SD-P are more pronounced for increasing M = N. We finally emphasize 
that these computational (algorithmic) complexity savings of SD-/°° over SD-/^ go along with a 
significant reduction in the circuit complexity for metric computation [7] (see the discussion in 
Section I-A). Indeed, the overall (circuit and algorithmic) complexity of SD-/°° is up to a factor 
of 5 lower than the overall complexity of SD-P. 



VII. Conclusions 

We analyzed sphere-decoding (SD) based on the /°°-norm and provided theoretical underpin- 
ning for the observations reported in [7]. The significance of /°°-norm SD is supported by the 
fact that its overall implementation complexity in hardware is up to a factor of 5 lower than that 
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for SD based on the Z^-norm (corresponding to optimum detection). In particular, we found that 
using the Z°°-norm instead of the Z^-norm does not result in a reduction of diversity order while 
leading to an SNR gap, compared to optimum performance, that increases at most linearly in the 
number of receive antennas. We furthermore showed that for many cases of practical interest 
norm SD, besides having a smaller circuit complexity for metric computation (thanks to the fact 
that it avoids squaring operations) also exhibits smaller computational (algorithmic) complexity 
(in terms of the number of nodes visited in the search tree) than /^-norm SD. The computational 
complexity of /°°-norm SD was found to scale exponentially in the number of transmit antennas 
as is also the case for /^-norm SD. 

Besides the /°°-norm, VLSI implementations are often based on the /^-norm (which does not 
require squaring operations either). The tools developed in this paper could turn out useful in 
analyzing the performance of SD based on the /^-norm as well. From a computational complexity 
point-of-view, however, the results in [7] suggest that /°°-norm SD is more attractive than l^- 
norm SD. More generally, it would be interesting to understand the impact of Z^-norm (sphere) 
decoding with general p and to investigate this impact for other channel models (such as ISI- 
channels, for example). 
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Appendix A 



Calculation of P 



M-m+l 



In the following, we derive ([38]). We start by introducing the RVs 

M 

Vm = RM-m+l,M-m+l\bM-m+l\, Um = RM-m+l,i bi + UM^rn+l- (106) 

i=M-m+2 

Since the nonzero entries in R and the entries in n are all statistically independent, Vm and u 



m 



are statistically independent as well. Here, Vm is a X2(m+L) -distributed RV with pdf (cf. ([T])) 

T{m + L)\bM-m+ir"'^^> 
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The RV Um is CA/'(0, a^) distributed, where 



al = ||b,„_i||VM + a^ 



(108) 



Exploiting the circular symmetry of Um, we have 



Thus, 



|[z(b)]M-m+l| = \Vra + Um\- 



Z(b)] 

Af-m+l| ^ Coo — P[|'^m + '^ml ^ Cqo] 



'^m ~l~ ^ Coo 



(109) 



(110) 

(111) 



For given Vm = v, the RV -|-|f +MmP is non-central X2 "distributed with non-centrality parameter 
. Thus, (see [32, Corollary 1.3.5]) 



2?r 
^2 



~l~ ^ Cqo Vr 



^ N 2s 1 



(112) 



Inserting ( [TOT] ) and ( fTT2l ) into ( [TTT] ) yields 



27s+i(Ci/a^)M-+^ 



s=0 



s!a^^r(m+L)|6M-^„+i|2(-+^)^o 



^2(s+m+L) 



'M-m + ll <^m, } 



Here, the integral can easily be rewritten such that the integrand is the pdf of a X2{s+m+L)- 
distributed RV (cf. ([T])), which then yields 



1. 



M^m+i\ -n.J(iy= Y{s + m + L)\- -+ „ 

2 WOM-m+ir (^r 



M 



-{s+m+L) 



Finally, using T(a) = (a — 1)! for positive integers a, we get 

„_n V / 



where 



s+m+L — 1 
m+L — 1 



p(b^)"^+^(l-p(b^))^ 



and, as defined in ( |40| ), 

p(b^ 



|b™_i||2 + Ma2 



(113) 



(114) 
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In the remainder of this section, we show that the infinite summation in (11 131) can be avoided. 



We use pm = p(bm) to simplify notation and we start by noting that ( |1 13[ ) can be written as 



^[\Vm + Um\ < Coo] 



fm 



(m + L-iy.J, 



oo / m+L—1 



s=0 



i=l 



e~'dt (115) 



where the identity ( |124| ) for the lower incomplete Gamma function was used. With g{x) 
gX^m+L-i ^YiQ series expansion = Yl'^o ff' have that 

oo / m+L—1 



X 



s=0 \ i=l 

On the other hand, by Leibniz's law for the differentiation of products of functions, we also 
have 

I r 1 

m+L-l\ (m + L-l] 
I 



m+L-l 



1=0 



{m + L-l - I) 



gX^m+L-l- 



Thus, ( |115| ) can equivalently be written as 

m+L-l 



P[K + «m|<C.o]= 1^ ^ J r(m + L-/) io 



m+L-l-1 ^-pmt 



By substituting t' = and again using identity ( |124[ ), we finally get 

m+L-l 



^[\Vm + Um\ < Coo] = 2^ ^ J 



1=0 - ^ ^ V ^m 



which, noting that 



VM' 



concludes the derivation of (38). 



(116) 



Appendix B 
Sum Representation of | [z(b)] 



M-m+l 



In the following, we prove ( [41] ) based on the following theorem. 
Theorem [26]. Consider the RVs 



.«=,(yr\y^^...,yW), /=o,... 



(0^ 



(117) 



where yf\ i = 1, . . . , a, for every I, are statistically independent RVs with pdfs equal to fi{x) 
if i < I and f2{x) otherwise. If g{-) is a symmetric function (i.e., g{-) is unchanged by any 
permutation of its arguments), then the pdf of 

z = giyi,y2,---,ya) (118) 
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where the yi, i = 1, . . . , a, are i.i.d. with mixture pdf 

fyX^) = Pflix) + (1 -P)f2{x), 0<p<l 

is given by 

a 

with 



(119) 



(120) 



1=0 



Bi= {]W{l-p) 



M-l 



Here, f^n) (x), / = 0, . . . , a, denotes the pdf of z^'-^ specified in ( |1 17[ ). 

We apply this theorem to the case at hand by defining fi{x) = 5{x) and f2{x) = f^i{x) and 
setting a = m + L — \. Furthermore, we take g{-) as 



g[Xi,X2,...,Xm+L-l) = ^ 2^Xi 



i=l 



which implies 



fzW {x) 



2x 



(121) 



(122) 



\\hm\\l/M + a2 •'^2('"+^-i-o V||b™||2/M + (t2^ 
if / < m + L — 1 and /^(o (x) = 5{x) if / = m + L — 1 for the pdfs of the RVs z^^'^ defined in 



(117). Using (120), we thus get 



m+L-l 



fz{x) 



y" Bi L2 



1=0 



2x 



with the corresponding cdf essentially given by the RHS of ( [38] ) but with two missing degrees 
of freedom in the x^. distributed RVs underlying the individual terms in the sum. To compensate 
for these two missing degrees of freedom, we construct the RV 

(123) 



lib IP 

,2 _ I Il""-ill2 



7 



with 7^ ~ being statistically independent of z. Noting that (/^2 * /^2)(x) = /^2^^(x), we 
obtain 



m+L—l 



\hm\\l/M + 



1=0 



2x 



or, equivalently. 



m+L~l y 

P[tl<x]= J2 Bnm+L-l[ 
1=0 ^ 



+^-Al|bm||i/M + cj2 



X 



\h.n,\\l/M + a\ 

d I 1 2 

thus, by comparison with ( [38] ), establishing that = | [z(b)]M-m+i | • Finally, ( |123| ) together 
with ( fTTSj ) and ( [TIT] ) shows ( |42l ). 
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Appendix C 

Bounds on Lower Incomplete Gamma Function 
In this section, we summarize properties of the lower (regularized) incomplete Gamma function 



y"- ^dy, x,a eR, x,a>0 



(124) 



r(a) Jo 

needed in this paper. In the remainder of this section, we will furthermore assume that a G N, 
which is the most relevant case for our results. We start by noting that 7a (a;) can equivalently 
be written as [33, Sec. 6.5] 



X 



7a(x) = e " ^ 

i=a 



a-1 



i=0 



X 



(125) 



(126) 



An immediate consequence of ( |126[ ) is 7i(x) = 1 — e ^. From ( |125[ ), we can directly infer that 

laiix) > 'Jaiix), ai < fla- (127) 
Furthermore, we have [34, Eq. (5.4)] 

(l - e-^'^y < 7a(x) < (1 - e-^y. (128) 
We will also need the relation 

[7ai(a:)]^ > [7a.(a;)]^, a, < (129) 
which will be proved by showing that [7a(x)]" is a nonincreasing function of a G N, i.e., 

[7a(x)]» > [7a+l(x)]^. (130) 



The proof is by induction. For a = 1, we have 71 (a;) > [72 (a;)] 2, which follows from ( |128[ ). It 
remains to show that 

[7n(a;)]" > [7n+i(a:)]^ , n G N (131) 



implies 

[7„+i(x)]^ > [7n+2(a;)]^ . 
To this end, we use [35, Lemma 3] which states that 

7n+i(x) > [7n(a;)]5 [^„^2(a;)]2 



(132) 



(133) 
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Inserting (131 1 into ( 133), we get 



which gives 



1 n + 2 1 

[7„+i(a;)]2n+i > [7^+2 (x)]^ 



establishing ( |132[ ) thereby concluding the proof. 
We will finally show that 



la 



Xi 



I + X2 



>7a(a;i)(l + X2)-" 



for any xi,X2 > 0. Inserting into the definition (124) yields 



7a 



Xi 



1 

1 / 1+^2 



I + X2J r(a) jQ 
which, upon substituting y = (1 + X2)y, can be rewritten as 



la 



I+X2 



T(a) 



y"- e ^+^2dy. 



Since e ^+^2 > e ^ for X2 > 0, we arrive at (134). 



(134) 



Appendix D 
asymptotics of radii 



A. Asymptotics of C2 in ( |52| ) 



For fixed SNR (i.e., fixed cr^) and fixed e, the asymptotic (N — > 00) behavior of C| 



cr^7^^(l — e) can be obtained as follows. According to [34, Eq. (2.13)] 



iN+l 



{N + V2Nx) = l-Q{V2x) +0{l/y^), N ^ 00 



for xgM, 0<x<oo. Therefore, we have 



where Q-\e + 0{l/v^)) = 0{1) showing that C| ~ a^N, N ^ 00. 
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y2 



B. Asymptotics of in ( [54] ) 

For fixed SNR (i.e., fixed cr^) and fixed e, the asymptotic (N oo) behavior of C'^ 
-cT2log(l - v^r^) is obtained as follows. We have ^^1^ = 1 + 0{1)/N, N ^ oo. Thus, 

which shows that ~ (T^log(A^), ^ oo. 

Appendix E 
Asymptotic Behavior of P[||zfc(bfc)||oo < Coo] 
In the following, we characterize the asymptotic (in SNR) behavior of P[||zfc(bfc)||oo < Coo]. 



This is done by splitting the product on the RHS in ( [37] ) into three parts, which are treated 



separately (recall the definition of m(bfc) in Section IV-Al as the index of the first erroneous 
tree level and the definition of Kqo in fTO])). 

• 171 = l,...,m(bfe) — 1: We have [z{h)]M~m+i = [n]j\/-m+i, which is CA/'(0, cr^) so that 



(135) 



|[z(b)]A/-m+i| <Coo\= 1i{k.oo)- Hcncc, the first part is given by 

= [liinoo)f^'"^-'^'' 



m(bfc)-l 

[iiM]'' n pf|[z(b)]A/-™+i| <a 



m=l 



m = m(bjfc): The second part corresponds to the first erroneous tree level associated with 



bfc. Here, we start by noting that (40) yields 



Ma' 



hm\\l + Ma^ 



where we used ||bm,_i||2 = 0. We thus have 



p(bJ~(p||b„||2/M) 



-1 



p ^ oo 



and 



Furthermore, 



1 - p{hr, 



p — > OO. 



(136) 



(137) 



a 



|b„||2/M + a2 



Im+L-l 



l + p\\h„,\\l/M 



and ( |125[ ) implies that 



1m+L-l 



l + p||b™||2/M 
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C^"-' (p||b„.ll^/M) 



-(m+L-l) 



{m + L-l)\ 
With ([38]) and ([136]) - ( [T38] ), we finally arrive at 



p — > CX). 



(138) 



z(b)]M-™+i| < Coo ~ (p ||b„||^/M) 



2 



p^oo (139) 



where 



m+L-l / y 

/=0 ^ ^ 



rn+L—l 



I J {m+L-l)\ 



m = m(bfc) + 1, . . . , /c: For these tree levels, we have ||bm-i||2 7^ 0, which yields 



P(br 



a II *-'m— 1 II 



llh l|2 

II *-'m II 2 



(140) 



Combining this result with (138) and (38), we thus obtain 



M-m+l ^ C'oo 



I f"-— 1II2 



lb l|2 

I m II 2 



(p||b^||2/M) \ p^oo 



so that 



II p[|[z(b)]M-™+i| <a 

m=rn(bfe)+l 



K 



fc^a(b,) ,-(A.-^(b,)) -Q p^oo. (141) 



P 



m=m(bfc)+l 



|b„||l/M)-+^ 



Next, note that 



n 



(llb^-ii/M)'"*'-! _ (||bffi,b.,i/Af)'»<i")+'- 



m,=m(bfe) + 



, (l|b™||i/M) 



m-|-L 



|b,||2/M)'=H-^ 



Combining (135), (139), (141), and (142) finally yields (71). 



(142) 



Appendix F 
Properties of A{^m(hk)) 

A. Limit of A{m{hk)) for 

We want to prove that 



lim A[m(hk)) = 1. 

With ( |125| ), we can write 



(143) 



November 18, 2008 



DRAFT 



50 



which gives 

A(m(b,)) = (i+o(i)f(^^)+^- y: 



(m(bfc) + L-/) 



m(bfe)+L-l-« 



for Koo — > establishing ( |143[ ) 



5. Monotonicity of A{m{bk)) 

In the following, we show that 74(m(bfc)) in ( |72l ) is a nonincreasing function of k^o (or, 
equivalently, noting that k^o = — log(l — v^l — e), A{m{hk)) is a nondecreasing function of 
e). This will be done by setting x = /too, ^ = ^(bfc) + L, and by showing that 



m— 1 



1=0 



m — 1 



/ y(m-/)! 



is a nonincreasing function of x > 0, or equivalently, f'{x) < 0, for x > 0. For m = 1 this holds 
trivially as f(x) = 1. We therefore consider the case m > 2 in what follows. The condition 
f'{x) < 0, for X > 0, is equivalent to 



e 

ifn 



' 1=0 



m — 1 



m—l 



I /(m-iy. 



m—l 



1 



/ y(m-/)! 



X 



X > 0. 



(144) 



Multiplying both sides of ( |144| ) by x"* > 0, and substituting i = in — I, it remains to show that 

p{x) > g(x), for X > (145) 

where 

fh ^ 

= (e"-l)y ^^a,x*-i (146) 
m — l 

i=l 

and 

m 

g(x) = ^a,x' (147) 

i=l 

with 

/m-l\ 1 

Here, we used = C^J^^^) . Evidently, a sufficient condition for ( |145[ ) to hold is that p(0) > 



g(0) and p'(x) > q'{x), for x > 0. Successively applying this argument, ( |145[ ) can be shown by 
proving that 



P 



{n 



> = forn = 0, . . . ,m 



\x = 



(149) 
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and 



P 



(m+l). 



(150) 



Condition ( |150| ) can be verified by noting that p^^'^^\x) > for x > (cf. ( |146[ )) and 



X] 



since q{x) in ( |147[ ) is a polynomial of degree fh. It thus remains to establish 



( |149| ). Since we have ]9(0) = and g(0) = 0, it follows that ( |149[ ) is trivially satisfied for n = 0. 
It therefore remains to show ( |149| ) for n = 1, . . . , m. By Leibniz's law for the differentiation of 
products of functions, we obtain 



n 



i = n + 1, . . . ,m 



for 5f(x) = (e"^ — \ which yields 



For the RHS of ( 149) we get 



Using ( |148| ), the condition ( |149| ) can thus be rewritten as 



f ^ \ffh — 2\l fm — 1 



n — \ 



n = 1, . . . ,m. 



RHS of (151 1 can be written as 



m — l 
n — 1 



m — 2 
n-2 



m — 2 
n — 1 



(151) 



Note that (151 1 is trivially satisfied for n = 1. It thus remains to consider n = 2, . . . ,m. The 



(152) 



The proof is concluded by showing that the sum of the two terms on the left hand side of (151 1 



corresponding to i = n and i = n — 1 is greater than or equal to the RHS in ( |152[ ). A direct 
comparison shows that this is the case if 

n \ 1 / n \ 1 

- > 1 and > 1 

n — IJ n \n — 2 J n — 1 

for n = 2, . . . , m. This is now easily verified by noting that J /n = 1 and (^"2) /(^ "~ 1) — 
n/2. 
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Appendix G 
Properties of pc(e) 



Using definition fTTj ) with ( [54| ) and ( [52| ), we have 



Pc(e) 



7r^((i-e)^/^) 

by noting that 71 (x) = 1 — (see Appendix Icl). 



A. Limits of pc{^) 

1) Limit of pc(e) for t 1: We want to prove that 



liin pc(e) 



Setting X = 1 — e, this amounts to showing that 



7iv 



hm , , , , 
^-0 7fi(a;V^) 



We start by considering the numerator in ( |154[ ) and note that ( |125[ ) implies 

1 



and thus 



7iv(2/) = ^Z/^(l + o(l)), 2/^0 



= Vn^.x'/"" (1 + o(l))-\ a; ^ 0. 



Similarly, for the denominator in ( |154[ ), we obtain 

=a;i/^(l + o(l))-\ x 



which, together with ( |155[ ), establishes ( |154[ ). 

2) Limit of pc(e) for e 0: We want to prove that 

Jim pc(e) = 1. 

Again, setting x = 1 — e, this amounts to showing that 



lim , , , , 
^-1 71"^ (xV^) 



1. 



(153) 



(154) 



(155) 



(156) 



We therefore need to prove that 'jj^^{x) ~ 7^ ^(x^/^), x — > 1. Starting with the denominator in 

.r'(^"")-iog(^). 



(156), we first note that 



(157) 
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Next, we have 



and hence 

which finally yields 

1 



(1 _ (1 _ = 1 _ _(i _ + o{{l - xf), X ^ 1 



1 - 1 



X 



N 



■(l + C(l-x)), x^l 



log 



establishing that 



log! ) +log(iV) + 0(l-a;) ~log^ ^ 



1 — X 



1 — a; 



X ^ 1 



7r'(^-"")~iog(^) 



X — > 1. 



(158) 



For the numerator in ( |156[ ), we first note that lim 7Ar(x) = 1, which implies that the x — 1 

x~*co 

asymptote of the inverse function 7^^(x) can be obtained by characterizing the x — >^ oo asymptote 



of 7iv(x). It follows from (126) that 



7Ar(x) = 1 



e-V-^(l + o(l)), x^oo 



which yields 



log((iV - 1)! (1 - 7^(x))) = -X + (iV - l)log(x) + o(l), 



X — > oo 



and hence 



-X, X — s> OO. 



log((iV-l)!(l-7;v(x))) ~ 
Now setting x = 'y]^^{y), we finally get 

lN\y) ~ -log((iV - 1)! (1 - y)) ~ log 



Together with (158 1, this implies (156). 



B. Monotonicity of pcie) 



In the following, we show that /Oc(e) in ( |153[ ) is a nondecreasing function of e on the interval 
[0, 1]. This will be accomplished by setting 1 — e = 7Ar(x), x G M, x > 0, and showing that the 
function /(x) = x/g(x) with 



g(x) = -log 1 - [7jv(a;)] 
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is nonincreasing in x > 0, or equivalently 



fix) 



g{x) - g {x)x 
g^{x) 



< 0, for a; > 0. 



It thus remains to show that 



g{x) — g'{x)x < 0, for x > 0. 



(159) 



Next, we note that g{x) is convex for a; > if and only if the first-order convexity condition 
g{x) + g'{x){y — x) < g{y) holds for all x,?/ > [36, Eq. (3.2)]. This first-order convexity 

condition evaluated at y = becomes ( |159[ ) by noting that g{Q) = 0. Consequently, it is 

j_ 

sufficient to show that g{x) is a convex function for x > or, equivalently, that 1 — [jn^x)]'^ 

i_ 

is log-concave for x > 0. The function 1 — [7Ar(x)]^ is a complementary cdf, which can be 
written as 



bN{t)]^) dt 



where \ [in{,x)]'^ j denotes the corresponding pdf. Using the fact that log-concavity of a pdf 
implies that the corresponding complementary cdf is also log-concave [37, Theorem 3], it is 
sufficient to show that 



bN{x)\- 



1 

N 



bN{x)\- 



1nK.x) 



N-l 



X 



(160) 



is log-concave for x > 0. Here, we used 'y'j^{x) = e ^x^ ^ 



/T{N) (cf. ( |T24l )). The log-concavity 
(or log-convexity) of functions is preserved by the multiplication with exponentials (which 
themselves are log-convex and log-concave), by positive scaling, and by taking positive powers 
[36], i.e., e"^'f(x), bv{x), [v{x)]^, a, 6 G M, 6 > 0, is log-concave (log-convex) if v{x) is log- 
concave (log-convex). Therefore, ( |160[ ) is log-concave if x^e~^ /'^^{x) (obtained by multiplying 
the RHS of ( |160| ) by A^r(A^)e'', taking the corresponding result to the power of A^/(A^ - 1) 



followed by multiplication by e ^) is log-concave. Equivalently, ( |160| ) is log-concave for x > 
if 



h{x) 
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is log-convex for x > 0. Next, with the series expansion ( |125| ) for 7Ar(a;), we obtain 

CO j 



i=Q 



Using the series representation of the confluent hypergeometric function 



F{a,b,x) = 



i=0 



(6), z\ 



where denotes the Pochhammer symbol, i.e., (a)j = a{a + 1) ■ ■ ■ (a + i — 1), = b(b + 
1) ■ ■ ■ (b + i — 1) with (a)o = (6)o = 1, we can write 



h{x) 



F{l,N+l,x). 



With the integral representation of F{a, b, x) [33], we finally get 



h{x) 



e"* (1 - t)^-'dt. 



(161) 



m) JO 

Applying the integration property of log-convex functions [36, p. 106], which states that log- 
convexity of v{x, y) in x for each y in some set C implies log-convexity of u{x) = J^^^ v{x, y)dy, 
we can conclude that h{x) is log-convex for x > if the integrand in ( |161| ) is log-convex in 
X for each t E [0, 1]. The proof is concluded by noting that this is trivially the case as the 
integrand, for each t E [0, 1], is proportional to an exponential function (which is log-convex) 
for aU t. 

Appendix H 
Calculation of P ||[z(b)]Af-m+i||~ < Css 

In the following, we derive an analytic expression for P || [z(b)]M-m+i||~ < C*55 under 
the assumption that &M-m+i is purely real, purely imaginary, or equal to zero. The real and 
imaginary parts of [z(b)]jvf-m+i are given by 

[z(b)]R^A/„m+l = Rm -m+l,M ~m-^lbK,M -m-^l + MR^m 
[z(b)] l,M—m+l — Rm —m+l,M ~m-^lb\^M —rn+1 ~\~ U\^rn- 

Here, Um ~ CAr(0,cr^) is specified in ( |106[ ) (cr^^ is specified in ( |108[ )) and Rivi-m+i,M-m+i ^ 
M. Under the assumption that bM-m+i is purely real, purely imaginary, or equal to zero, 
[z(b)]R^Af-m+i and [z(b)]i^A/_m+i are statistically independent, which yields 



N(b)]M- 



m+1 



R,M-m+l 



[z(b)] 



l|<C5 



(162) 
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Let us first assume that bM-m+i is purely real, i.e., fe^z-m+i = &R,Af-m+i 7^ 0. Similar to 
( |109| ), we can write | [z(b)]R,A/-m+i| = \vm + MR,m| and | [z(b)]i,M-m+i| = \ui,m\, where Vm = 
RM-m+i,M-m+i\bM~m+i\ IS a scaled X2(m+L) -distributed RV with pdf ( |107D and mr,™ and mi,^ 



are i.i.d. A/'(0, (t,^/2). The RV ^|Mi,m| is thus xi -distributed, which gives 



7i 



(163) 



f, the RV + MR,mP is non-central x^-distributed with non-centrality 



For given Vm 

parameter Thus, following the steps ( |112[ ) - ( |113| ), we obtain 



(164) 



where Ds(bm) was defined in ( |114[ ). Note that the only difference between ( |164[ ) and ( |113| ) is 
the occurrence of the factor 1/2 instead of the factor 1 in the index of the incomplete Gamma 



function. As a result, however, it seems that (164) cannot be expressed as a finite sum as was 



done for ( |1 13| ) to arrive at ( |1 16| ). The final expression ( [99[ ) now follows by combining ( |162[ ), 
( |163| ), and ( |164[ ). The cases feM-m+i = ^i,A/-m+i 7^ and 6M-m+i = can be seen to result in 



'9% by following the steps ( 162) - (164) properly modified. 
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